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(57) ABSTRACT 

A system and method classify a pixel of image data as one 
of a plurality of image types. A first image characteristic 
value for the pixel, a second image characteristic value for 
the pixel, a third image characteristic value for the pixel, and 
a fourth image characteristic for the pixel is determined. 
Some of these determinations may be resolution dependent. 
The values from these determination are utilized in assign- 
ing an image type classification to the pixel. An auto- 
segmentation circuit enables cjassifying_a__ pixel as a con - 
tirmGii5tnn^ an'^ ^ptprting a gt;adi ent in the window of pixels 
oFwhich the pixel is a member. A filter enh an cesjthe.image 
data of-tbe-pixel-tone-wfaeD"Xgradi ent indicating that imag e _ 
data^within-lhejyindow-<)f-pixels1s^anging at a rate greater 
than a predetermined threshold is detected or lowpasses the 
image data of the pixel tone when a gradient indicating that 
image data within the window of pixels is changing at a rate 
less than a predetermined threshold is detected. 
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METHOD AND SYSTEM FOR CLASSIFYING of the image frequcDcy characteristics, without prior thresh- 

AND PROCESSING OF PIXELS OF IMAGE elding. Valleys in the resulting auto-correlated function are 

DATA detected to determine whether a high frequency halftone 

image is present. 

This application is a continuation-in-part of application 5 An example of a "prior art" automatic segmentation 

Sen No. 09/010^72, filed on Jan. 21, 1998 in the name of circuit is illustrated in FIG. 6. The basic system as shown in 

Jeng-Nan Shiau now U.S. Pal. No. 6,185328. FIG. 6 is made up of three modules. Input information stored 

in a data bufifer 10 is simuhaneously directed to an image 

FIELD OF THE PRESENT INVENTION property classifying section 20, the first module, and an 

image processing section 30, the second module. The image 

The present invention relates generally to a system for property classifying section 20, is made up of any number of 

processing document images, and more particularly, to an submodules, (e.g. auto-correlator 21 and discriminator 22), 

improved method of image processing the document images which determine whether a block of image pixels stored in 

utilizing a fuzzy logic classification process. the data buffer 10 is one type of imagery or another, (e.g. 

uAnfruniTMn np twp pdc^pmt halftone, line/text, or cootone). In parallel with the image 

BACKGROUND OF THE PRESENT 15 property classifying section 20, the image processing section 

INVENTION 30 ^ made up of any number of sub-processing sections. 

In the reproduction of images from an original document (e g- high frequency halftone processor 31 low frequency 

or images from video image data, and more particularly, to halftone processor 32, line/text processor 33, or contone 

the rendering of image data representing an original docu- Processor 34) which perform image processing operations 

ment that has been electronically scanned, one is faced with " same block of image pixels as section 20. Each miage 

Umited reflectance domain resoluUon capabilities because f.^b-processmg section performs image processmg opera- 

, , . . . - • ■ , tions that are adapted to improve the image quahty of a 

most output devices are binary or require compression to ^^^^^^ ^ ^/^^^^^ module, control section 

binary for storage effiaency. This is particularly evident informafion derived from the image classifying 

when attemptmg to reproduce halftones, hnes, and contmu- ^^-^^ jO, to control the image processing section 30. In 

ous lone (contone) images. ^ther words, the control section 41 acts like a multiplexer 

An image data processing system may be tailored so as to and selects the proper processed image data according to the 

ofiket the limited reflectance domain resolution capabilities image classification determined by the image classifying 

of the rendering apparatus, but this tailoring is difficult due section 20. 

to the divergent processing needs required by different types The decision as to what class of imagery image data 

of images which may be encountered by the rendering bek)ngs to is typically binary in nature. For example, in. a 

device. In this respect, it should be understood that the image conventional image segmentation scheme image prope^Jj^ 

content of the original document may consist of multiple classifying section 20 classifies image data as one of three\ 

image types, including halftones of various frequencies, classes of imagery, (high frequency halftone, low frequency I 

continuous tones (con tones), line copy, error diffused halftone, or contone). Depending on those classification, J 

images, etc, or a combination of any of the above, and some image data is processed according to the properties of that 

unknown degree of some or all of the above or additional class of imagery is selected, (either low pass filter and 

image types, re -screening if it*s a high frequency halftone, threshold with 

In view of the situation, optimizing the image processing a random threshold if it is a low frequency halftone, etc.). 

system for one image type in an effort to offiset the limita- 40 Also, assuming that the decision as to which of the three 

tions in the resolution and the depth capability of the classes of imagery image data belongs is based on a single 

rendering apparatus may not be possible, requiring a com- image property, the peak count of the input image data, the 

promised choice which may not produce acceptable results. resulting image classification decision of the peak count 

Thus, for example, where one optimizes the system for low image properly is made by thresholding the peak count into 

frequency halftones, it is often at the expense of degraded 45 three classes of imagery. 

rendering of high frequency halftones, or of line copy, and Consequently, the control section 40 decides the type of 

visa versa. image processing the image data requires depending on the 

To address this particular situation, "prior art" devices decision made by the classification section 20. Thus, the 

have utilized automatic image segmentation to serve as a output of classification section 20 is quantized to one of 

tool to identify different image types or imagery. For 50 three possibilities. The control section 40 selects the output 

example, in one such system, image segmentation was from one of the three image sub-processing sections based 

addressed by applying a function to the video, the output of upon this classification, 

which was used to instruct the image processing system as Based on the nature of conventional image classification 

to the type of image data present so that it could be processed systems, the classifying information, gathered over a context 

appropriately. In particular, an auto -correlation function was 55 of many pixels, changes gradually. But in the process of 

applied to the stream of pixel data to detect the existence and comparing this classifying information with a classification 

estimate the frequency of halftone image data. Such a threshold one could create abrupt change in the classes. This 

method automatically processes a stream of image pixels abrupt decision making, which produces a forced choice 

representing unknown combinations of high and low fre- among several distinct alternative choices, is a primary 

quency halftones, contones, and/or lines. The auto- 60 reason for the formation of visible artifacts in the resulting 

correlation function was applied to the stream of image output image. Most transition points or thresholds are 

pixels, and for the portions of the stream that contain high selected so that an image can be classified as one class of 

frequency halftone image data, the function produced a large imagery with a high degree of certainty; however, those 

number of closely spaced peaks in the resultant signal. classes of imagery that cannot be classified with such 

In another auto-segmentation process, an auto-correlation 65 certainty have multiple transition points or a transition zone, 

function is calculated for the stream of halftone image data Using only one point to define a transition zone results in 

at selected time delays which are predicted to be indicative the formation of visible artifacts in the resulting output 
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image if the output image spans in the transition zone. Image processing section 30 includes units 31, 32, and 34 

Although it is possible to shift or make the transition zone that generate output data firom the image data in accordance 

narrower so that there is less chance that an image falls into with methods unique to each predetermined class of imag- 

the zone, there exists limitations on how narrow the zone can ery. Subsequently, mixer 41 combines a percentage of each 

be made. The narrowing of the transition zone is the 5 class of output image data from units 31, 32, and 34 

decreasing of noise and/or variation in the information used according to the ratio of the probabilities 27 determined by 

to classify so as to narrow the area over which classification classifier 25. The resulting output image daU for mixer 41 is 

is not "certain", resulting in less switching between classi- gjored in output buffer 50 before subsequent transmission to 

ncations. jmage output terminal such as a printer or display. 

Moreover, the classification of real images covers a i ,, . c ■ • 1 r 

continuum from weU below to well above thresholds '° . Inma^ly te stream of miage p«^^ 

between classificaUons. Tliis means that there are areas of an ("T} « fed to data buffer 10. The miage data stored 

image which are, for example, just above a threshold. "'"^^'f S^^V f°™*'> example, 6 to 8 bits 

Variations in the gathered (lowpass filtered) information due P^r P'xel- A suitable block size is 1 6 pixels at 400 spots per 

to "flaws" in the input video or ripple due to interactions ">ch, or 12 pixels at 300 spots per inch. Too large of a sample 

between the area of image being used for the classification " ^ize results in the inability to properly switch classification 

process and periodic structures in the input video results in "^"^ channels between fine structures in the image, or 

areas falling below the threshold. With discrete to switch soon enough when moving from one classification 

classification, this results in a drastically different to another. An example of this problem is small text forming 

classification, thereby resulting in artifacts in the rendered » ^ halftone image. Given a font size which is large 

^ggg 20 enough to read, a good layout practice of leaving white 

Tlius, it is desirable to classify image data in a fuzzy '^^'''^ ^ " ''^If » ^^"^^^ '^^ 

manner, slowly shding the classification from one classifi- """S^' block turns out to be a good 

cation to the other, reflecting the information that has been compromise with most documents. TTius. too large a sample 

„o*i,««^ A^.fo^tc tu^ «=.T,i*;«r, «„ri«r*.,i Jn^or,^ «»u ^izc rcsults m classification transitions at the edge oi objects 
gathered. Artitacts in the resulting rendered image will now , , l u-. l . i_- . i. 
be soft and follow the contours of the image, and so the " '° ^e larger than the wh.tespace between the objects, result- 
artifacts will not be objectionable "^S mappropnate classification and rendermg. 

In general, the "prior art" describes the control section 40 ^ith reference FIG. 3, the conventional probabilistic 

as essentiaUy having a switch. Since the image processing classifier 25 is shown in detail. The block of image pixels 

steps performed for each class of imagery are different ,„ stored in buffer 10 is transmitted to a characteristic calcu- 

depending on the classification given to each block of input 28 through data buffers 15. Calculator 28 provides an 

image pixels, the switch or multiplexer allows data residing 0"'P"' ^^lue that characterizes a property of the image data 

at the output of the image processor 30 to be directed to an transmitted from buffer 10, such as its peak count. In one 

output buffer 50 depending on the decisions made by the embodunent, a characteristic value is determined by calcu- 

imagery classifying section 20 which are received as signals '^'^^ 28 that represents the peak count of the block of image 

on Unes 23 and 24. This type of binary decision making is data. The peak count is determined by countmg those pixels 

rigid and results in image segmentation decisions that do not whose values are the non-trivial local area maximum or 

fail gracefuUy and consequently form visible artifacts in the minimum in the block of image data. First local area 

output image maximum or minimum pixel values are selected depending 

To address this forming of visible artifacts in the rendered ,o °° f^^^^^^ l^^'f P^^f ^ ^ ^'l^f 

output image, it has been proposed to utilize a pmbabilistic '"^^^^ ^/^^ u I 

segmentation process to aUow the image processing system °f '^^^^^ ^^^^ P*^^*' 

to fail more gracefully when incorrect segmentation deci- ^^^r calculator 28 evaluates the peak count of the unage 

sions are made. An example of such a probabilistic segmen- data, probabiUty classifier 29 determines three probability 

tation system is illustrated in FIG. 2. 45 values 27 that correspond to each image type associated with 

FIG. 2 shows a block diagram of a conventional image ^o^^it ^ expressed by the characteristic function 

processing system which incorporates a probabilistic clas- stored m memory 26. The characteristic function, deter- 

sification system. As illustrated in FIG. 2, the conventional "^^^^ ^P^^" ^^g^ ^^^^^ represents a plurahty of 

system receives input image data derived from any number probability distributions that are determined using a popu- 

of sources, including a raster input scanner, a graphics 50 "^^^es. Each probability distribution depicts the 

workstation, an electronic memory, or other storage probabUity that a block of image data is a certam type given 

elements, etc. In general, the image processing system occurrence of an image property, a peak count, 

shown in FIG. 2 includes probabilistic classifier 25, image For example, the characteristic function stored in memory 

processing section 30, an image processing and control 26 can be represented by the graph shown in FIG. 4, which 

mixer 41. 55 relates the probability distributions for a contone 1, low 

Input image data is made available to the image process- frequency halftone 2, and high frequency halftone 3 to the 

ing system along data bus 15, which is sequentially pro- occurrence of a particular image characteristic, which in this 

cessed in parallel by probabilistic classifier 25 and image example is a peak count. The characteristic function stored 

processing section 30. ProbabiUstic classifier 25 classifies in memory 26 can be adjusted using input control 18. Using 

the image data as a ratio of a number of predetermined 60 control 18, the resulting output image stored in buffer 50 can 

classes of imagery. The ratio is defined by a set of prob- altered by modifying the characteristic function repre- 

abihty values that predict the likelihood the image data is senting the different classes of imagery evaluated by the 

made up of a predetermined number of classes of imagery. image processing system 30. 

The probabilities 27, one for each predetermined class of Subsequently, probability classifier 29 determines each 

imagery, are input to the image processing mixer or control 65 probability value by evaluating the probability distribution 

unit 41 along with image output data from image processing of each image type represented by the characteristic function 

section 30. stored in memory 26. After determining the probability 
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values, classifier 29 outputs these results to image process- combined with an additional one-dimensional transfoim unit 

ing mixer or control 41 . to produce a plurality of distinct Iwo-dimensional filteis, any 

The image processing section of FIG. 2 operates concur- one of which is selectable on a pixel by pixel basis. 

renUy with the probabilistic classifier 25 on the image data Moreover, this proposed conventional system has the added 
stored in buffer 10. Image processing section 30 includes a 5 advantage of providmg two-dmiensional finite impulse 

high frequency halftone processing unit 31, a low frequency '«^P°'^ filters without etnploying muUiple idenucaUy con- 

. Tr. „ „ i-i r ^«t^«- x^<^oir,« siructed two-dimensional filters arranged in a parallel 

halftone processing unit 32, and a contone processing unit ^ substantially reducing the complexity and 

34. Each processmg unit processes aU unage data in accor- ^^^^ ^j^J ^^^^^ ^ ^^^^^^ understandkig of 

dance with a particular image type. Each of the processing conventional system, the conventional system %viU be 
units 31, 32, and 34 generates output blocks of unquantized lO described below 

video data. conventional system, as illustrated in FIG. 1, includes 
Image processing control 41 mixes the data output blocks (mage processing module 20 which generally receives offset 
to form a composite block of output image signals that is ^nd gain corrected video through input line 22. 
stored in output buffer 50. The manner in which the output Subsequently, the image processing module 20 processes the 
blocks are mixed is characterized by a ratio defined by the " ^^^^ ^^g^ ^j^ta according to control signals from CPU 24 
probability determined by the probabilistic classifier 25. (o produce the output video signals on line 26. As illustrated 
FIG. 5 shows the conventional image processing mixer 41 in FIG. 1, the image processing module 20 may include an 
in detail. Mixer 41 multiplies the output blocks with the optional segmentation block 30 which has an associated line 
probability, using multipliers 42, 43, 44. The resulting buffer 32, two-dimensional filters 34, and an one- 
output from each multiplier is representative of a percentage dimensional rendering block 36. Also included in image 
or ratio of each output block, the sum of which defines a processing module 20 is line buffer memory 38 for storing 
composite block of output image signals. The composite the context of incoming scanlines. 

block of output image signals is formed by adding the output Segmentation block 30, in conjunction with the associated 
of the multipliers using adder 45 and by subsequently scanline buffer 32, automatically determines those areas of 
quantizing the sum of adder 45 using quantizer 47. The ijje image which are representative of halftone input region, 
resulting image block output by quantizer 47 is stored in Output fi-om the segmentation block, (video class), is used to 
output buffer 50 before subsequent transmission for output implement subsequent image processing effects in accor- 
to an image output terminal having limited resolution or ^ance with a type or class of video signals identified by the 
^cp*h. segmentation block. For example, the segmentation block 
The above-described image classification system utilizes may identify a region containing data representative of an 
a probabilistic approach to classify the image data. Such an input high frequency halftone image, in which case a low- 
approach presents problems in that the classification of the pass filter would be used to remove screen patterns, 
image data is mutually exclusive, the image data is classified otherwise, a remaining text portion of the input video image 
as a particular type in absolute terms even though the may be processed with an edge enhancement filter to 
probability of the decision being correct is just over 50%. improve fine line and character reproduction when thresh- 
This results in difficulties in trying to design an image olded. 

processing system which will process the image data with- Two-dimensional filter block 34 is intended to process the 

out visible artifacts in the rendered image when the decision incoming, corrected video in accordance with the predeter- 

on the image type does not have a high confidence. mined filtering selection. Prior to establishment of the 

Not only is image classification important to a digital required scanfine content, the input video bypasses the filter 
reprographic system, rendering based on this classification is by using a bypass channel within the two-dimensional filter 
important. One such component of the rendering system is hardware. This bypass is necessary to avoid delirious effects 
digital filtering. The digital filtering process should be both to the video stream that may result from filtering of the input 
efiScient and low cost. Moreover, the filter design should 45 video prior to establishing the proper context, 
have some non-separable and/or time-varying characteris- Subsequent to two-dimensional filtering, the one- 
tics so that the fiher can be used in a fuzzy segmentation dimensional rendering block is used to alter the filtered, or 
system However, trying to achieve one goal or another can possibly unfiltered, video data in accordance with selected 
adversely impact the other goal. Various approaches have one -dimensional video effects. One-dimensional video 
been devised for the implementation of digital filtering 50 effects include, for example, thresholding, screening, 
techniques which try to solve minimize the adverse impacts. inversion, tonal reproduction curve (TRC), pixel masking. 
These techniques will be discussed briefly below. one-dimensional scaling, and other effects which may be 

In a "prior art" digital filtering technique, a two- applied one-dimensionally to the steam of video signals. As 

dimensional finite impulse response filter having a plurality in the two-dimensional filter, the one-dimensional rendering 
of filter portions of essentially identical construction are 55 blocks also includes a bypass channel where no additional 

arranged in a parallel configuration. A de-multiplexer sepa- effects would be applied to the video, thereby enabling the 

rates an input data signal comprising consecutive digital received video to be passed through as an output video, 

words and supplies each digital word in sequence to a Therefore, it is desirable to implement an image classi- 

separate filter portion. Subsequently, a multiplexer, coupled fication system which provides a truer classification of the 
to the output of the filter portions, selectively outputs the 60 image type and the image types are not necessarily mutually 

filtered data from each filter portion in a sequence corre- exclusive. Such a system would incorporate fuzzy logic, 

sponding to the order of separation of the input data, thereby thereby allowing image data to be classified as being a 

resulting in a filtered version of the original input data. member of more than one image class. This feature is critical 

The system described above all has the limitation with in areas where the image goes from one image type to 
respect to either speed or high cost. In view of these 65 another. Moreover, it is desirable to implement an image 

limitations, it has been proposed to provide a plurality of processing and rendering system which takes advantage of 

one-dimensional transform units that may be selectively the fuzzy classification system. 
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SUMMARY OF THE PRESENT INVENTION being presented for illustrative purposes only and should not 

r\ ™ # f .u * * *• • J f be limitative of the scope of the present invention, wherein: 

One aspect or the present invention is a method for ^ ^ 

determining a continuous tone classification type of a pixel FIG. 1 is a schematic illustration of a conventional unage 

of image data in a window of pixels. The method classifies processing hardware module incorporating a two- 

the pixel as representing a continuous tone; detects a gra- ^ dimensional filter, 

diem in the window of pixels of which the pixel is a member; FIG. 2 is a block diagram illustrating a conventional 

and classifies the pixel as a rough continuous tone when said image processing system incorporating probabilistic seg- 

step (b) detects a gradient indicating that image data within mentation; 

the window of pixels is changing at a rate greater than a FIG. 3 is a block diagram detailing the probabilistic 

predetermined threshold. segmentor shown in FIG. 2; 

A second aspect of the present invention is a system for FIG. 4 shows an example of a characteristic function of 

determining a continuous tone classification type of a pixel the image property, peak count; 

of image data in a window of pixels. The system includes an piG. 5 is a block diagram illustrating in detail the image 
auto-segmentation circuit to enable classifying a pixel as a processing mixer shown in FIG. 2; 
continuous tone and means for detecting a gradient in the pjQ 5 ^ ^lock diagram Ulustrating a "prior art" image 
window of pixels of which the pixel is a member. The processing system- 
auto-segmentation circuit classifies the pixel as a rough r-,^ _ . .1 '1 r j- 1 cu • 

, . j-. j-.' .u.- J. FIG. 7 IS a block diagram of a digital filtering system 

contmuous tone when a gradient indicatmg that image data . *■ .u a 1 c f*u * 

.... . , r • 1 - i_ . . f mcorporatmg the fuzzy classification process of the present 

withm the window of pixels is changmg at a rate greater than ^„ • ^. 

J , • J .u u 1 J • J * f J 20 mvenlion; 
a predetermmed threshold is detected. 

. . , . r *u ^ • • J • FIG. 8 IS a block diagram illustrating a screemng system 

A third aspect of the present mvention is a rendermg * *ua i t 

c ^ If J*' -J c mcorporatmg the fuzzy classification process of the present 

system for processme a pixel of image data in a window of . ^. ^ 

o o mvention' 

pixels based on a continuous tone classification type. The ^ ^ J . . , . .„ . , . 

system includes an auto-segmentation circuit to enable clas- P^^. 9 is a block diagram illustraUng a more detailed 

sifying a pixel as a continuous tone; means for detecting a ^^^^^^ ^y^^^"^ illustrated m FIG. 8; 

gradient in the window of pixels of which the pixel is a FIG. 10 is a block diagram illustrating a scalable unage 

member; and a filter to enhance the image data of the pixel classification system archilecmre according to the concepts 

tone when a gradient indicating that image data within the of present invention; 

window of pixels is changing at a rate greater than a 3Q FIG. 11 is a block diagram illustrating another scalable 

predetermined threshold is detected. image classification system architecture according to the 

A fourth aspect of the present invention is a method for concepts of the present invention; 

determining a continuous tone classification type of a pixel FIG. 12 is a block diagram illustrating a third scalable 

of image data in a window of pixels. The method classifies image classification system architecture according to the 

the pixel as representing a continuous tone; detects a gra- 35 concepts of the present invention; 

dient in the window of pixels of which the pixel is a member; FIG. 13 is a graphical representation of an enlarged view 

and cla.ssifies the pixel as a smooth continuous tone when of a half toned area; 

said step (b) detects a gradient indicating that image data FIG. 14 is a graphical representation of detected peaks for 

within the window of pixels is changing at a rate less than the graphical representation of FIG. 13; 

a predetermined threshold. 40 FIG. 15 is a graphical representation of an enlarged view 

A fifth aspect of the present invention is a system for of a ladder chart; 

determining a continuous tone classification type of a pixel fiQ Uissi graphical representation of false detection of 

of image data in a window of pixels. The system includes an halftone peaks for the graphical representation of FIG. 15; 

auto-segmentation circuit to enable classifying a pixel as a ^ ^ ^ graphical representation of a reduction in the 

continuous tone and means for detecting a gradient in the 45 f^lse detection of halftone peaks for the graphical represen- 

window of pixels of which the pixel is a member. The ^^tion of HG. 15 according to the concepts of the present 

auto-segmentation circuit classifies the pixel as a smooth invention* 

coDtinuous tone when a gradient indicating that image data ^ ^^j^^j representation of an enlarged view 

withm the window of pixels is changing at a rate less than ^ y^^-^ area- 

a predetermined threshold is detected. 50 „„ ' ... ... , .. „ 

. ... , . FIG. 19 IS a graphical representation of false detection of 

A sixth aspect of the present invention is a rendering halftone peaks for the graphical representation of FIG. 18; 

system for processing a pixel of miage data m a wmdow of pj^, is a graphical representation of a reduction in the 

pixels based on a contmuous tone classification type. Tlie ^^^^^.^^ ^^^^^^ ^^^^ ^ ^ ^^^^ 

system includes an auto-segmentation circuit to enable clas- j^jj^^ „f jg ^^^^j ,^ ^ 

sifying a pixel as a continuous tone; means for detectmg a 55 ^^^j^^^jj. 

gradient in the window of pixels of which the pixel is a ' . ... ■ ^ 

member; and a filler to lowpass the image data of the pixel , ^1 ? » representation of a ajmparison 

tone when a gradient indicating that image data within the between a frequency response of a triangular filter and a 

window of pixels is changing at a rale le« than a predeler- ^«=°f'^"'g «=°°'=''P'« °^ P^^*"' 

mined threshold is detected. ,1 

Further objects and advantages of the present invention a block diagram UlustraUng an architecture for 

will become apparent from the foUowing descriptions of the ^ parallel two-dmiensional blur filter according to the con- 

r. f.u *• cepts of the present mvention; 

various teatures or the present invention. *^ 

RG. 23 is a circuit diagram of a filler circuit according to 

BRIEF DESCRIPTION OF THE DRAWINGS ^5 the concepts of the present invention; and 

The following is a brief description of each drawing used FIG. 24 is a block diagram illustrating an architecture for 

in describing the present invention, thus, the drawings are a two-dimensional blur filter. 
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DETAILED DESCRIPTION OF THE PRESENT erties of the physical image to an electronic or electrical 

INVENTION representation. Id other words, in this situation, a pixel is an 

^ ^ „ .„ , J J J . . r ^ ^ electrical, (or optical), representation of the physical optical 

Tlie foUowmg>^l be a detailed descnptiono^^ .^^ ^ -^^1 ^ ^^^^^ ^ bysical 

ings Illustrated in the present invention. In tlus descripuon, ^ ^^g^^^^j^ ^„ ^ ^^^^j » ^be 

the terms "image data or 'pixels in the form of video environment, may refer to an electrical, (or 

miage signals, which may be either analog or digital voltage ^^^j^ ^^^^ ^ ^^^^ 3^ ^^-^^ represents the 

representations of an image, indicate a representation of an ^^^^^^^ ^ ^^^^ ^ recording substrate that can be 

miage provided from a suitable source. For example, the j^^dered. In other words, in this situation, a pixel is an 

unage signals may be obtained through kne by hnescanmng electrical, (or optical), representation of the physical spot 

of an image bearmg the ongmal by one or more photosen- ^^^^^ ^ ^ j^^^ ^ ^ j^^^ ^^^^ 

sitive elements, such as a multiple photosite array of charge ^^^^^ ^ ^^^^^^ rendered ink droplet, 

couple devices commonly referred to as CCDs. Line by line r .1. a ♦ j 

^ . f . . ■ I r A, *■ F Many of the documents produced today are compound 

scanning of an image beanng the original for the duration of . . • . j . _. r 1 

f , . 11 1 ,„„ ° . j„ f „ „f documents m that the documents are composed of several 

imaee data is well known and does not form a part of the . . • . r j-nr . • 

^ . . 15 different sub-images that are of different image types or 

present invention. . c r .u * 7 / u * 

^ image classes. Some of the common types are text, photos 

Image data may also be denved by a computer worksta- (contones), and halftones. One reason for the increased 

tion program in accordance with document creation appli- appearance of compound documents is the widespread use 

cation software or from a data storage device. In content, the commercially available word processing and desktop 
onginal video unage signals may be composed entirely of a 20 pubUshing software that is able to generate them, 

single image component such as lines, text, low frequency ^ ^ ^^^^^^^ ^ 

halftones, bghfr^ processing in order to provide optimal image 

quality. Conventionally, to automatically choose the best 

The following description also includes references to processing for different areas of an image, each area is 
slowscan and fastscan digital image data when discussing 25 classified into one of several pre-defined classes to deter- 

the directionahty of two-dimensional filtering architecture. njine how to render that part of the image. This image type 

For purposes of clarification, fastscan data is intended to ^^^^^ ^y^^ information can then be used to determine the 

refer to individual pixels located in a succession along a appropriate processing required to obtain a good rendition of 

raster of image information, while slowscan data refers to ^^e image when printing, to choose a method of image 
data derived from a common raster position across multiple 30 compression, to determine if optical character recognition 

rasters or scanlines, ^oulj 5^ useftil, etc. 

As an example, slowscan data would be used to describe However, as noted previously, the classification process 

signals captured from a plurality of elements along a linear should not be so crisp so as to avoid problems when the input 

photosensitive array as array is moved relative to a docu- image is not very similar to any of the classes, or the input 
ment. On the other hand, fastscan data would refer to the 35 ^^^^^^ properties straddle the border between two classes 

sequential signals coUecled along the length of the linear ^ ^ particular action is taken based upon a 

photosensitive array durmg a single exposure period which ^^^^ ^j^^ identification because the classes are mutually 

is also commonly referred to as a raster of data. exclusive, it may create undesirable results for a non- 

Moreover, in describing the present invention, it is prototype image. This is seen when rendering images for 

assumed that the video signal has a value in a range between printing on a xerographic printer. The classification of the 

0 and 255. However, any range from the video signal can be image can cause output artifacts such as when a hafftone 

utilized in conjunction with the present invention. image is classified as a contone image. 

Furthermore, in the foUowing description, the term "grey Another type of problem is that adjacent areas of the 

level" will be used to describe both black and white and image may be classified differently due to smaU variations in 

color applications. the image. This is caUed class switching. If this information 

Furthermore, in describing the present invention, the term is used for image enhancement and printing, the output may 

"pixel" will be utilized. This term may refer to an electrical, have objectionable artifacts due to local variations, 

(or optical, if fiber optics are used), signal which represents Examples of these objectionable artifacts are grainy image 

the physical measurable optical properties at a physical outputs. 

definable area on a receiving medium. The receiving Jq eliminate the above described problems, an image 
medium can be any tangible document, photoreceptor, or classification system which utilizes a fuzzy membership into 
marking material transfer medium. each category or class can be used. In other words, the 
Moreover, the term "pixel" may refer to an electrical, (or classes in a fuzzy classification system are not mutually 
optical, if fiber optics are used), signal which represents the 55 exclusive, thereby eliminating problems with class switch- 
physically measurable optical properties at a physically ing and also allowing those areas to have processing differ- 
dcfinable area on the display medium A plurality of the ent than that of any of the other pre-defined classes; i.e., the 
physically definable areas for both situations represent the output can choose between a continuum of possible image 
physically measurable optical properties of an entire pbysi- processing techniques. 

cal image to be rendered by either a material marking gg in standard classification techniques, each area has one 

device, electrically or magnetic marking device, or optical class assigned to it. In the fuzzy implementation of the 

display device. » present invention, each area has a classification vector 

Lastly, the term "pixel," in the scanning environment, assigned to it. Every element of the classification vector has 

may refer to an electrical, (or optical, if fiber optics are a membership value associated with each of the pre-defined 
used), signal which represents physical optical property data 65 prototype classes. 

generated from a signal photosensor cell when scanning a Similar to the creation of crisp classifiers, a set of heuristic 

physical image so as to convert the physical optical prop- rules are used to determine the form of the classifier. The 
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following is an example of how heuristic rules are used to been setup with a third "other" class, at least one of the 

create a fiizzy classifier, according to the concepts of the elements of the vector must be greater than or equal to 0.5 

present invention. and less than or equal to 1. 

For illustrative purposes, an example of a two class Using the two class example above, the fuzzy classifica- 
non-fuzzy system is discussed. In this example, the system 5 tion rules would be the following. If the summation of the 

only classifies a particular region as either contone (i.e., grey squares of the Laplacian at every pixel in the subblock is 

pictorial) or text. An image may be considered text if there greater than a predetermined summation threshold and the 

are a lot of edges and most pixels are black or white. If this sum of the percentage of pixels with grey value less than a 

is not true, the picture is considered contone. black threshold value and the percentage of pixels with grey 

In order to determine edges, a variable relating to the value greater than a white threshold value is greater than a 

Laplacian of the image data at every point (pixel) is used. A predetermined bi-modal threshold, the pixel would be 

typical implementation for this type of segmentation may be assigned a membership value for the "text" class which is 

if the summation of the squares of the Laplacian at every the minimal value associated with each of the conditional 

pixel in the subblock is greater than a predetermined sum- statements. 

mation threshold and the sum of the percentage of pixels To better understand this concept, the following brief 
with grey value less than a black threshold value and the explanation of fuzzy logic will be provided. In fuzzy logic, 
percentage of pixels with grey value greater than a white unlike Boolean logic, the results of the conditional state- 
threshold value is greater than a predetermined bi-modal ments do not generate either absolute true or absolute false, 
threshold, the image is text, else the image is contone. but a value corresponding to the amount of the resulting 

In this example, since the parameters are device statement which is true. This result is due to the fact that the 

dependent, tests, which are known to those skilled in the art, conditional statements are also not absolute, 

would be run to determine the values of all of the param- For example, in the above described rule, from testing, it 

eters; the percentage of pixels with grey value less than a may be determined that the midpoint (predetermined target 

black threshold value, the percentage of pixels with grey conduion value) of the fuzzy Laplacian summation condi- 

value greater than a white threshold value, the summation tion should be 50. The midpoint represents maximum uncer- 

threshold, and the bi-modal threshold before executing the tainty as to whether the value 50, in this example, is a 

segmentation routine. Note that only one class can be member of the class Large Laplacian. Moreover, from 

chosen, either text or contone. testing, it is determined that, with absolute certainty, a pixel 

In order to implement a fuzzy classifier, according to the is member of the Large Laplacian (membership equals LO) 

concepts of the present invention, several modifications if the summation is equal to or greater than 75 

must be made to the above described heuristic rules. As (predetermined absolute condition value) and it is deter- 

described above, there exists a single rule defining only text. mined that, with absolute certainty, a pixel is not a member 

If the condition for text is "not true," contone is chosen. In of the Large Laplacian (membership equals 0.0) if the 
the fuzzy system of the present invention, contone must have ^5 summation is equal to or less than 25 (predetermined 

it's own rule, since the text membership rule is not an absolute condition value). Fuzzy logic allows the classifier 

absolute tmth, but a relative truth that the classification is to assign 0.5 (conditional value) to a result where the 

true. summation is 50 and linearly extrapolate to the assigned 

Moreover, even providing a rule for contones will not values (conditional values) to 1 and 0.0 for the values 75 and 
satisfy the excluded middle law; therefore, a third "other" 40 25, respectively; i.e.. value 55 would be assigned a mem- 
class must be added to satisfy the constraints of fuzzy logic. bership value of 0.6. Note that these values are device 
Without the "other" class, it would be possible to have dependent, and thus, the midpoint and the range needs to be 
membership of the image in all classes be very small. Thus, determined for each individual device, 
the "other" class creates a lower bound of a half (0.5) for the Furthermore, from testing, it may be determined that the 
minimum membership in any given class. A minimum 45 midpoint of the class bi-modal should be 80; with absolute 
magnitude for the maximum membership assures that all certainty, a pixel is in the membership if the percentage sum 
actions/decisions made using the relative membership val- is equal to or greater than 90; and, with absolute certainty, 
ues are not extremely sensitive to the class memberships, a pixel is not in the membership if the percentage sum is is 
which they would be if membership in all classes was small, equal to or less than 70. Fuzzy logic allows the classifier to 
thereby making the fuzzy classification more robust. 5Q assign 0,5 to a resuh where the sum value is 80 and Unearly 

In the fuzzy classification scheme of the present extrapolate to the assigned values to 1 and 0.0 for the values 

invention, the pixel or unit of image data has a membership 90 and 70, respectively; i.e., value 85 would be assigned a 

in each of the three classes; text, image and "other." In other membership value of 0.75. Note that these values are device 

words, the pixel is no longer considered to be an element of dependent, and thus, the midpoint and the range needs to be 
just one of the mutually exclusive classes. However, if the 55 determined for each individual device, 

determination for one class reach absolute certainty; i.e. the To further explain the fuzzy technique, it is assumed that 

membership in a single class is 1 and the other classes is the membership values for each conditional statement are 

zero; the fuzzy system does generate values which would 0.5 and 033, respectively. In this scenario, the membership 

represent a crisp system. value for the pixel for the class text would be 0.33 because 

In view of this non-exclusivity characteristic of the pixel 60 fuzzy logic treats "ANDed" statements as determining the 

image membership, the membership of the pixel is repre- minimal value for all the conditions and assigning the 

sented by a membership vector, V,-, whose entries corre- minimal value to the membership value, 

spond to the membership of the pixel (image element) in Using the contone rule of if the summation of the squares 

each of the classes. Note, typically, there are no constraints of the Laplacian at every pixel in the subblock is less than 
on this vector other than all of it's elements must be greater 6S a predetermined summation threshold and the sum of the 

than or equal to 0 and less than or equal to 1. However, since percentage of pixels with grey value less than a black 

the fuzzy classification rules of the present invention have threshold value and the percentage of pixels with grey value 
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greater than a white threshold value is less than a predeter- 
mined bi-modal threshold, the image is "contone," each 
conditional statement will be discussed in fuzzy logic terms. 

For example, in the above described rule, from testing, it 
may be determined that the midpoint of the fuzzy Laplacian 5 
summation condition should be 50. Moreover, from testing, 
it is determined that, with absolute certainty, a pixel is in the 
membership if the summation is equal to or less than 25 and 
it is determined that, with absolute certainty, a pixel is not in 
the membership if the summation is equal to or greater than ^0 
75. Fuzzy logic aUows the classifier to assign 0.5 to a result 
where the summation is 50 and linearly extrapolate to the 
assigned values to 1 and 0.0 for the values 25 and 75, 
respectively; i.e., value 55 would be assigned a membership 
value of 0.4. 

Furthermore, from testing, it may be determined that the 
midpoint of the fijzzy bi-modal condition should be 80; with 
absolute certainty, a pixel is in the membership if the sum is 
equal to or less than 70; and, with absolute certainty, a pixel 
is not in the membership if the sum is equal to greater 90. 
Fuzzy logic allows the classifier to assign 0.5 to a result 
where the percentage value is 80 and linearly extrapolate to 
the assigned values to 1 and 0.0 for the values 70 and 90, 
respectively; i.e., value 85 would be assigned a membership 
value of 0.25. 

To further explain the fuzzy technique, it is assumed that 
the membership values for each conditional statement are 
0.75 and 0.8, respectively. In this scenario, the membership 
value for the pixel for the class text would be 0.75 because 
fuzzy logic treats "ANDed" statements as determining the 
minimal value for all the conditions and assigning the 
minimal value to the membership value. 

Lastly, the fuzzy rules states that if image is neither "text" 
or "contone," the image is "other." This last rule, which 35 
defines the "other" class, can be represented mathematically 
as jU^,A.r(image)=min(l-/i,^^Xiinage), 1-// 

contone 

(image)) 

where fijy) is the membership of Y in the class X. Note that 
if /*,^Xif^age), and //^^^^(image) are smaller than 0.5 then 
Z'or/i^Xiniage) will be greater than 0.5 (as stated earlier). In 40 
the example, given above, /i,^_^Xiraage) is equal to 0.33 and 
i«ca«fanXimage) IS equal to 0.75, thus,^^;,^Xiaiage) would be 
equal to 0.25, with the resulting membership vector being 
[0.33 0.75 0.25]. Note the element values of the vector need 
not add up to 1. 45 

The predicate of each the rules described above is 
extended to a fuzzy truth instead of an absolute truth to 
provide the element value for the membership vector. Thus, 
in order to make the inequality "Y is >X" a fuzzy truth, a 
membership function is defined for ">X". Similarly, a fuzzy 50 
membership rule can be defined for <X (very often, the 
membership in (<X) is equal to not (>X): (1 -membership of 
(>X)). 

For simplicity in implementation, the membership in (>X) 
is defined as follows: 55 



A>jrO0-(l,Y>X+AX, 
0,Y<=X-AX, 

(Y-X+AX)/(2AX)^-AX<Y<-X+AX) 



60 



The value of AX determines the level of fuzzification of 
the class; if AX is extremely small, then the definition 
reduces to the crisp definition of greater than. It is further 
noted that although the fuzzification has been described as a 65 
linear relationship, the function describing the values 
between the end points and the mid point may be any type 



of function. Moreover, the midpoint could represent abso- 
lute certainty in the class and have a membership value of 1 
and the endpoints represent absolute certainty of non- 
membership such that the membership values would graphi- 
cally form a triangle with the midpoint being the peak. 

Reluming to the multiple "If s" in the above rules, the 
membership of image in the class text is equal to the fiizzy 
value of the predicate, ^,^^Ximage)«min(//,5/^ Threshold 
{S\f\ fi^sunod.1 77,n.^/White+Black)). 

To expand the concepts of the present invention to the 
processing of images on a typical xerographic laser printer 
requires separating images into several classes; for example, 
white, black, edge, pictorial, low frequency halftone, mid 
frequency halftone, high frequency halftone and other, etc. 
The classes white, black, and pictorial are subclasses of the 
set "contone" and low fi-equency halftone, mid frequency 
halftone, high frequency halftone are subclasses of the set 
"halftone." 

In a preferred embodiment of the present invention, the 
deterministic values for determining membership are as 
follows: 

BLACK_%=the percentage of pixels with grey value less 

than a black threshold value; 
WHITE_.%=the percentage of pixels with grey value 

greater than a white threshold value; 
Sij=Sum of the absolute values of the Laplacians in a 

window around the pixel being classified; 
Range ==Max grey Level-Min grey level inside a window 

around the pixel being classified; and 
Freq=Measurement of local 2-D frequency around the 

pixel being classified, 
To determine the membership value in a particular class, 
these values are compared to a variety of predetermined 
thresholds in a similar manner as described above with 
respect to the three class system. The various classes in this 
preferred embodiment are demonstrated by the following 
rules: 

If (Sij is >SIJ__HALFTONE and RANGE is >RANGE_ 

HALFTONE and FREQ is >FREQ_HALFTONE), 

then CI is HALFTONE; 
If (Sij is >SIJ__EDGE and RANGE is >RANGE„EDGE 

and FREQ is <FREQ_HALFTONE), then pixel is 

EDGE; 

If (Sij is <SIJ_HALFTONE and FREQ is <FREQ_„ 

HALFTONE), then CI is CONTONE; 
If (CI is CONTONE and BLACK_% is >BLACK_ 

THRESHOLD), then pixel is BLACK; 
If (CI is CONTONE and WHITE_% is >WHITE_ 

THRESHOLD), then pixel is WHITE; 
If (CI is CONTONE and BLACK_% is <BLACK_ 

THRESHOLD and WH1TE__% is >WH1TE„ 

THRESHOLD), then pixel is PICTORIAL; 
If (CI is HALFTONE and FREQ is <LOW_FREQ), then 

pixel is LOW_FREQ_HALFTONE; 
If (CI is HALFTONE and FREQ is >LOW_FREQ and 

FREQ is <HIGH_FREQ), then pixel IS MID_ 

FREQ_HALFTONE; and 
If (CI is HALFTONE and FREQ is >HIGH_FREO), 

then pixel is HIGH_FREQ_HALFTONE; and 
If (pixel is not BLACK and pixel is not WHITE and pixel 

is not PICTORLAL and pixel is not EDGE and pixel is 

not LOW_FREQ_HALFrONE and pixel is NOT 

MID__FREQ_HALF1^0NE and pixel is not HIGH_ 

FREQ_HALFTONE), then pixel is OTHER. 
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The predicate of each the rules described above is 
extended to a fuzzy truth instead of an absolute truth to 
provide the element value for the membership vector. Thus, 
in order to make the inequality "Y is >X" a fuzzy truth, a 
membership function is defined for ">X", Similarly, a fuzzy 5 
membership rule can be defined for <X (very often, the 
membership in (<X) is equal to not (>X): (1 -membership of 
(>X)). 

For simplicity in implementation, the membership in (>X) 
is again defined as follows: lO 

a>aO0-Ci.y>x+ax, 



0,Y<=X-AX, 

(Y-X+AX)/(2AX)P(-AX<Y<»X+AX) 



15 



The value of AX determines the level of fuzzification of 
the class; if AX is extremely small, then the definition 
reduces to the crisp definition of greater than. It is further 
noted that although the fuzzification has been described as a 20 
linear relationship, the fimelion describing the values 
between the end points and the mid point may be any type 
of function. Moreover, the midpoint could represent abso- 
lute certainty in the class and have a membership value of 1 
and the endpoints represent absolute certainty of non- 25 
membership such that the membership values would graphi- 
cally form a triangle with the midpoint being the peak. This 
type of class could be used for a membership function of the 
class "oX". 

Returning to the multiple "IPs" in the above rules, the 30 
membership value of image in the class "edge" would be 
equal to the fuzzy value of the predicate, jW^(image)«min 

(j^>Sip Threshold ff^P)* f^>Range ThreshotJM^^grvy~^^^grey)-» 

^<Fr^q Thr^shatA'^^ Frcq)); the membership value of image in 
the class "black" would be equal to the fuzzy value of the 35 
predicate, /ifc/^^^(image)-min(/<^^/^ Threshold (21p), 
Tk^^p-^ Freq)./^5/«c* Threshoil^o of bUck pixels)); the 
membership value of image in the class "white" would be 
equal to the ftizzy value of the predicate, ^^.^(image)«min 

{l^<Slp Threshold (^^P)* M<Freq ThreshaldC^^ Ff^q), 40 

^<whue Threshaidi^ of white pixcls)); the membership value 
of image in the class "pictorial" would be equal to the fuzzy 
value of the predicate, //p,.^,^„.^X"Dage)=minOu^,^ 

Threshold 

(Hp), M<rreq Thr.sholJ2^ FrCq), ^^j^^^ ThresholJ.% of black 

pixels), ju^whue ThreshoiJ^^ of ^Mte pixels)); the membership 45 
value of image in the class "low frequency halftone" would 
be equal to the fuzzy value of the predicate, //^^^^image)« 
Threshold (21p) h>Rnnge 77.^.Ac./d(Max -Min^^^), 
^.F^j^^^S^^ Freq),/f^„^^„^ rhreshoicP-^ F^q)); the mem- 
bership value of image in the class "mid frequency halftone" 50 
would be equal to the fuzzy value of the predicate, ju^^^^ 
(image)=minOM^5,^ 

Threshold (^IP). h>Range Thresholc^^^gny' 
Min ), ^^f^^ ThresholJ?^ F^eq), ^^j^^^preq ThresholA'^^ 

Freq), ThreshoiA^^ Freq)); and the membership 

value of image in the class "high frequency halftone" would 55 
be equal to the fuzzy value of the predicate, /i^^^image)» 

Threshold (?^P)f^>Rartse Threshol/M^^g^y'^'^grey)y 
f^>Freq Thr^sholA^^ FrCq), ^^HighFreq ThreshalA^^ Freq)). 

To implement the fuzzy segmentation process for the two 
image class situation, image data received from the image 60 
source is divided into blocks of pixels. The fuzzy image 
classifier then, to determine a membership value for a 
particular image type, calculates the summation of the 
squares of the Laplacians in a window around the pixel 
being classified. Moreover, the fuzzy classifier calculates the 65 
percentage of pixels that have a grey value less than a 
predetermined black value and determines the percentage of 



pixels within the block which have a grey value greater than 
a predetermined white value. 

After calculating this information, the fuzzy classifier 
determines the conditional value for the condition relating to 
the summation of the squares of the Laplacian of every pixel 
in the block being greater than a Laplacian threshold and the 
conditional value for the condition relating to the sum of the 
percentage of pixels with a grey value greater than the 
predetermined black value and the percentage of the pixels 
with a grey value greater than the predetermined white value 
is greater than a bi-modal threshold value. Upon determin- 
ing these two conditional values, the fuzzy classifier deter- 
mines the minimal value and assigns this value as the 
membership value for the pixel in the "text" class. 

The fuzzy classifier then determines the conditional value 
for the condition relating to the summation of the squares of 
the Laplacian of every pixel in the block is less than a 
Laplacian threshold value and the conditional value for the 
condition relating to the summation of the percentage of the 
pixels with a grey value less than a predetermined black 
value and the percentage of the pixels with a grey value 
greater than the predetermined white value is less the 
bi-modal threshold value. Upon determining these two con- 
ditional values, the fuzzy classifier determines the minimal 
value and assigns this value as the membership value for the 
pixel in the "contone" class. 

The fuzzy classifier thereafter determines the membership 
value for the pixel in the "other" class by determining the 
minimal value between 1-the "text" membership value and 
1-the "contone" membership value and assigns this value as 
the membership value for the pixel in the "other" class. 

In this process, the pixels of image data received by the 
fuzzy classifier are assigned membership values for the three 
possible fuzzy classifications, "text," "contone," or "other." 
As noted above, the utilization of the "other" class is 
necessary in order to avoid having membership of the image 
in all classes to be very small. 

An example of implementing fuzzy classification for a 
laser xerographic printer, will now be discussed, more 
specifically, a process, carried out by a fuzzy classifier, to 
assign membership values to the pixels of image data for 
eight possible types or classes. 

As with the process described above, the process begins 
by dividing the pixels of image data into blocks of pixels. 
Thereafter, each block of pixels is analyzed to determine the 
sum of the absolute values of the Laplacians in a predeter- 
mined window around the pixel being presently analyzed; to 
determine a range value which is equal to the maximum grey 
level minus the minimum grey level inside the predeter- 
mined window around the pixel being presently analyzed; to 
determine a firequency value which is equal to the measure- 
ment of the local two-dimensional frequency around the 
pixel being presently analyzed; to determine a black value 
which is equal to the percentage of pixels that have a grey 
value less than a predetermined black value; and to deter- 
mine a white value which is equal to the percentage of pixels 
within the window having a grey value greater than a 
predetermined white value. 

Once these various values are determined, the fuzzy 
classifier begins the assigning of the membership values. 
The fuzzy classifier determines the conditional value for the 
condition relating to the sum of the absolute values of the 
Laplacian in the predetermined window around the pixel 
being presently analyzed being greater than a halftone 
threshold, the conditional value for the condition relating to 
the range value being greater than a range halftone 
threshold, and the conditional value for the condition relat- 
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ing to the frequency value being greater than a frequency 
halftone threshold. Upon determining these three condi- 
tional values, the fiizzy classifier determines the minimal 
value and assigns this value as the membership value for the 
pixel in the "halftone" class. 5 

Then, the fuzzy classifier determines the conditional value 
for the condition relating to the sunmation of the absolute 
values of the Laplacians in the predetermined window 
around the pixel being presently analyzed being greater than 
an edge threshold, the conditional value for the condition 
relating to the range value being greater than a range edge 
threshold, and the conditional value for the condition relat- 
ing to the frequency value being less than the frequency 
halftone threshold. Upon determining these three condi- 
tional values, the fuzzy classifier determines the minimal 
value and assigns this value as the membership value for the 
pixel in the "edge" class. 

The fiizzy classifier thereafter determines the conditional 
value for the condition relating to the sum of the absolute 
values of the Laplacians in the predetermined window 
around the pixel being presently analyzed being less than the 20 
halftone threshold and the conditional value for the condi- 
tion relating to the frequency value being less than the 
frequency halftone threshold. Upon determining these two 
conditional values, the fuzzy classifier determines the mini- 
mal value and assigns this value as the membership value for 25 
the pixel in the "conlone" class. 

Tiie fuzzy classifier then determines the conditional value 
for the condition relating to the pixel being a contone image 
and the conditional value for the condition relating to the 
black value being greater than a black threshold value. Upon 30 
determining these two conditional values, the fuzzy classi- 
fier determines the minimal value and assigns this value as 
the membership value for the pixel in the "black" class. 

Subsequently, the fuzzy classifier determines the condi- 
tional value for the condition relating to the pbcel being a 35 
contone image and the conditional value for the condition 
relating to the white value being greater than the predeter- 
mined white threshold. Upon determining these two condi- 
tional values, the fiizzy classifier determines the minimal 
value and assigns this value as the membership value for the 40 
pixel in the "white" class. 

The fuzzy classifier determines the conditional value for 
the condition relating to the pixel being a halftone image and 
the conditional value for the condition relating to the fre- 
quency value being less than a low frequency threshold 45 
value. Upon determining these two conditional values, the 
fuzzy classifier determines the minimal value and assigns 
this value as the membership value for the pixel in the "low 
frequency halftone" class. Thereafter, the fuzzy classifier 
determines the conditional value for the condition relating to so 
the pixel being a halftone image, the conditional value for 
the condition relating to the frequency value being greater 
than the low frequency threshold value, and the conditional 
value for the condition relating to the frequency value being 
less than a high frequency threshold value. Upon determin- 55 
ing these three conditional values, the fuzzy classifier deter- 
mines the minimal value and assigns this value as the 
membership value for the pixel in the "mid frequency 
halftone" class. 

The fiizzy classifier determines the conditional value for 60 
the condition relating to the pixel being a halftone image, the 
conditional value for the condition relating to the frequency 
value being greater than the high frequency threshold value. 
Upon determining these two conditional values, the fuzzy 
classifier determines the minimal value and assigns this 65 
value as the membership value for the pixel in the "high 
frequency halftone" class. 
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Lastly, the fuzzy classifier determines the membership 
value for the pixel in the "other*' class by determining the 
minimal value between 1-the "edge" membership value, 
1-the "black** membership value, 1-the "white" membership 
value, 1-the "pictorial" membership value, 1-the "low fre- 
quency halftone" membership value, 1-the "mid frequency 
halftone" membership value, and 1-the "high fi^quency 
halftone" membership value and assigns this value as the 
membership value for the pixel in the "other** class. 

By utilizing these processes, the fuzzy image classifier 
can eliminate problems with class switching in areas 
between two or more pre-defined types. In other words, 
these processes implement a fuz^y classification scheme 
which allows the defining of various image types or classes 
to fail gracefully. Moreover, by implementing a fuzzy clas- 
sification process, the fuzzy process allows the fuzzy mem- 
berships to have enhancement, different than any of the 
pre-defined classes. More specifically, the image processing 
system can choose between a continuum of possible pro- 
cessing operations to produce an output. 

As noted above, fuzzy classification avoids sudden tran- 
sitions in the classification domain, which results in mini- 
mizing abrupt transitions in density, A practical implemen- 
tation of fiizzy classification is only step-wise continuous 
(not truly continuous due to the discreteness of the data 
being analyzed). Thus, if the fiizzy classification is over little 
rectangles, the slight residual transitions will still be visible 
and objectionable in the rendered output image. On the other 
hand, a pixel -by-pixel without fuzzy classification will not 
have any visible artifacts — but will be very noisy or grainy. 
The transitions will be on an individual pixel basis, but the 
transitions will be perceived as huge. Such transitions would 
be equivalent to injecting huge amounts of noise into the 
image. 

However, as contemplated by the present invention, by 
combining fuzzy classification with a pixel-by-pixel classi- 
fication implementation, the overall classification system 
can prevent rectangular artifacts (using pixel-by-pixel 
classification) in the rendered output image and reduce the 
classification switching noise (using fuzzy classification). In 
other words, the present invention utilizes a fuzzy classifi- 
cation process wherein this process is appUed on a pixel- 
by-pixel basis and not on a window basis or globally. 

In other words, a window implementation would make a 
classification based on the contents of the window and 
assigned all pixels within that window the same image type 
classification value or classification membership vector 
value, whereas a pixel-by-pixel implementation classifies 
each pixel individually based on the contexts surrounding 
the pixel or the image values in a neighborhood of this pixel. 
Although it is desirable for the pixel to be centered, it is not 
necessary. For example, the pixel being classified can be in 
the lower right of the context. In the pixel-by-pixel imple- 
mentation the neighborhood moves along the fastscan direc- 
tion one pixel at a time and along the slowscan direction one 
scanline at a time, while the window based implementation 
moves along the fastscan direction to the next adjoining 
non-overlapping window of pixels and moves along the 
slowscan direction to the next adjoining non-overlapping 
window of pixels. 

Another practical advantage of a pixel-by-pixel basis is 
that the image can be classified and rendered in one pass. A 
windowed or global based approach requires the system to 
analyze the entire region (window or image) before making 
a classification decision. By then the video may be gone, llie 
system either has to store it, then process it through the 
renderer beginning after the system has analyzed the entire 
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region, or arrange for the image source to feed through the 
system twice. This increases system cost and may also slow 
down the system to throughput. Pixel based requires only 
a small context since the context is limited and always in the 
neighborhood of the pixel being classified. 

The present invention also allows the determination of 
output parameters such as filter coeflBcients and screening 
level given a fuzzy classification vector and a corresponding 
desired output for each prototype class. The present inven- 
tion can also accommodate output types that must satisfy 
certain constraints (such as the mean gain of the filter) 
through the use of the "other" class. The fuzzy image 
processing greatly attenuates switching noise common in 
crisp decision image segmentation processes. 

Processing image data for laser printer reproduction 
requires the implementation of two separate actions: image 
processing to enhance, through manipulation, which does 
not result in loss of information, such as filtering and TRC 
translation; and a lossy conversion of the resuhant image to 
a representation accepted by a print engine, normally a 
reduction in the number of bits per pixel through either 
application of a screen or error diffusion. By utilizing a 
fuzzy classification scheme, the present invention can easily 
unify the processing implied by all of the partial member- 
ships into a single processing action. 

For example, each of the classes, defined using fuzzy 
classification, is provided with an ideal processing scheme. 
This ideal image processing scheme would represent the 
image processing techniques used if the membership in that 
class was one (1.0) and all other classes had a membership 
of zero (0.0). The fuzzy image classification process, 
described above, provides the mechanism for determining 
the output for a given fuzzy membership because the output 
is a vector. Thus, fuzzy classification can be used to choose 
each element of the vector independently, thereby, as noted 
above, providing a continuum of possible image processing 
operations. However, this continuum is not directly condu- 
cive to determining screening operations and determining 
filter coeiEcients. More specifically, a screening operation, 
by its nature, is discrete; either a screen is applied or it isn't. 
At every point, a comparison is made between the image 
data (video level) and the screen value (screen level) to 
determine if a pixel is to be turned OFF or ON. Thus, on its 
face, screening would appear to be a poor candidate for 
fuzzy classification control because of its inability to be 
applied at differing levels. 

The other problem arises with determining filter coefiG- 
cients. Image processing requires that the filter coefficients 
sum to 1 (i.e., a constant grey level in produces the same 
grey level out). If there is the choice between several filters, 
their combination may no longer meet this requirement. 

The present invention resolves the first situation, the 
problem with applying a screen in a fuzzy classification 
environment, by replacing the screening operation with a 
linear operation that provides the same effect. Instead of 
comparing all video to a pre-defined screen, a 2-D sinusoidal 
type function, with a maximum amphtude at 45 degrees, is 
added to the incoming video. 

This screen, referred to as hybrid screening, creates video 
that is more likely to cluster output pixels in an error 
diffusion process. This clustering of video is the effect that 
is very pleasing to the himian eye in constant or near 
constant grey areas. It is the clustering effect that makes this 
screened output superior to a straight non-screened error 
diffused output. 

The frequency characteristics of the 2-D hybrid screen 
determines the approximate line pair per inch (Ipi) dot 
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pattern seen in the output; however, the ability to collect 
pixels into large dots, the desired effect in a screening 
process, is dependent on the screen's amplitude. This 
amplitude, a scalar, can be modified easily using the fuzzy 
5 image classification process of the present invention. All of 
the rules can assign a level (i.e., amplitude) of screening 
(large, medium, small or none, etc.). The size and frequency 
of the screen are predetermined to match the characteristics 
of the printer. 

jQ The image processing of an image using a fuzzy classi- 
fication process is very similar to the fuzzy classification 
process itself. For example, in the screening case, the fuzzy 
image processing method would establish three screening 
classes (large, medium, and none) to determine the screen 

j5 value to be applied to pixel. Each of these classes would 
have a set of rules as in the fuzzy classification method to 
determine the membership value for the pixel in the screen- 
ing processing class. The membership value would then be 
used to calculate the actual screen amplitude which will be 

2Q described in more detail below. 

In a preferred embodiment of the present invention, the 
mles for the screening classes are as follows: 

If (pixel is "edge" or pixel is "low frequency halftone" or 
pixel is "mid frequency halftone" or pixel is "other"), 

25 then screen is NO_SCREEN; 

If (pixel is "black" or pixel is "white"), then screen is 

MEDIUM_SCREEN; or 
If (pixel is "pictorial" or pixel is "high frequency 
halftone"), then screen is FULL_SCREEN. 

30 Referring to the multiple "If s" in the above rules, the 
membership value of image in the class "NO_SCREEN" 
would be equal to the fuzzy value of the predicate, fi^o^ 
5c/?E£AXscreen)=maxCa^^^^(pixel), /^/,,^/^^A.///o„.(pixel), 
/W^^.^A.z/^o,..(pixel).//^,A.r(pixel)); the membership value of 

35 image in the class "MEDIUM_„SCREEN" would be equal 
to the fuzzy value of the predicate, ^medwm^creej^ 
(screen)=max(My^^jt(ptxel), itV-Aii^Cpixel)); and the member- 
ship value of image in the class "FULL_SCREEN" would 
be equal to the fuzzy value of the predicate, ^full_screen 

40 (screen)»max(//^.,,,„-,Xpixel), hHishfr.^haiftarJ:?^^^'^))- 

To implement the fuzzy screening, the processing system 
determines the membership value of the pixel in each of the 
classes associated with a particular screening process and 
assigns the maximum value as the membership value in the 

45 screening class. For example, if the pixel had membership 
values for "edge," "low frequency halftone," "mid fre- 
quency halftone," and "other" of 0.6, 0.7, 0.2, and 0.3, 
respectively, the processing system would decode the mem- 
bership vector and assign a membership value to the 

50 NO_SCREEN class of 0.7. Moreover, if the pixel had 
membership values for "black" and "white" of 0.6 and 0.5, 
respectively, the processing system would decode the mem- 
bership vector and assign a membership value to the 
MEDIUM_SCREEN class of 0.6. Lastly, if the pixel had 

55 membership values for "pictorial" and "high frequency 
halftone" of 0,3 and 0.4, respectively, the processing system 
would decode the membership vector and assign a mem- 
bership value to the FULL_SCREEN class of 0.4. 
To determine the actual amplitude for the screen value, 

60 the fuzzy processing module, in the preferred embodiment 
of the present invention, multiply the membership value for 
each screen class with the ideal coefBcicnt for that class and 
divide the product by the sum of the membership values. In 
the preferred embodiment, the NO_SCREEN coefiBcient is 

65 0, MEDIUM_SCREEN coefficient is 0.5, and FULL_ 
SCREEN coefficient is 1.0. Thus, in the example described 
above, the screen amplitude for the pixel being processed 
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would be equal to a scalar value of 0.412 (((1.0*0.4)+ To solve this problem, the present invention, in the 

(0.5*0.6)+(0.0*0.7))/(0.4+0.6+0.7)). preferred embodiment, uses the weighted summation of 

An example of the application of fuzzy classification to several pre-defined filters to produced the filtered results, 

screening is illustrated in FIG. 8. As illustrated in FIG. 8, the These filters are filters associated with a particular filter 

video signal or image data is fed into a fuzzy classifier 80 5 class; i.e., one filter class is enhance, one filter class is 

which classifies the image data according to the rules lowpass, and another filter class is "other". The digital filter 

described above. The fuzzy image classifier 80 then gener- of the present invention takes the form of F =IiM,-F,- where all 

ates a membership vector which is passed onto a screening of the ¥,s correspond to the filters associated with each i"^ 

generating circuit 88. The screen generating circuit 88 class (or classes) and /^.corresponds to the membership of the 

produces a screen value which is added to the image data at i^age in the ith class, as determined by the fuzzy processing 

adder 89. The image data which is summed with the screen routine 

value corresponds 10 the same pixel as was classified by the ,„ , ' j^^g^, embodiment of the present invention, the 

fuzzy image classifier 80. n other words, the system^ lUus- ^^^^ ^j^^ 

are as follows: 

trated m FIG. 8 also mcludes buffers (not shown) to msure ir / ■ , • « ^ » • i • « • . • i» • i ■ «! 

that the pixel being processed corresponds to the correct ^"^^^ ^ "^^"L ^ ^''^ 

membership vector being used to define the processing is frequency halftone ), then filter is ENHANCE; or 

parameters. (pixel is "high frequency halftone"), then filter is 

A more detailed example of the application of fuzzy LOWPASS. 

segmentation to screening is illustrated in FIG. 9. As illus- Referring to the multiple "If s" in the above mles, the 

trated in FIG. 9, the video signal or image data is fed into a membership value of image in the class "ENHANCE" 

fuzzy classifier 80 which classifies the image data according 20 would be equal to the fiizzy value of the predicate, //j^a^c^- 

to the rules described above. The fiizzy image classifier 80 (filter)=max(^^rf^^(pixel), Mio^/rcqhotfto^eiV^^^^)^ Mpictonat 

then generates a membership vector which is passed onto a (pixel)); and the membership value of image in the class 

screening generating circuit 88. The screen generating cir- "LOWPASS" would be equal to the fuzzy value of the 

cuit 88 includes a screening weighting circuit 883, a screen predicate, fii^ownKsJfi^^^^)"^^^ (Mhighfn:qMaiftone(p^^^^))- 

value generating circuit 881, and a multiplier 885. 25 To implement the fuzzy screening, the processing system 

The screen weighting circuit 883 generates a weighting determines the membership value of the pixel in each of the 
factor in response to the values in the membership vector so classes associated with a particular screening process and 
as to produce a no screen, a medium screen, or a high screen, assigns the maximum value as the membership value in the 
or any screen therebetween, as discussed above. In other screening class. For example, if the pixel had membership 
words, if the pixel had membership values for "edge," "low 30 values for "edge," "low frequency halftone," and "pictorial" 
frequency halftone," "mid frequency halftone," and "other" of 0.6, 0.7, and 0.3, respectively, the processing system 
of 0,8, 0.7, 0.1, and 0.3, respectively, the processing system would decode the membership vector and assign a mem- 
would decode the membership vector and assign a mem- bership value to the ENHANCE class of 0.7. Moreover, if 
bership value to the NO_SCREEN class of 0.8. Moreover, the pixel had membership values for "high frequency half- 
if the pixel had membership values for "black" and "white" 35 tone" of 0.6, the processing system would decode the 
of 0.4 and 0.1, respectively, the processing system would membership vector and assign a membership value to the 
decode the membership vector and assign a membership LOWPASS class of 0.6. 

value to the MEDIUM„SCREEN class of 0.4. Lastly, if the To detenmine the actual coefi&cients for the various filters, 

pixel had membership values for "pictorial" and "high the fuzzy processing system must ensure that the fuzzy 

frequency halftone" of 0.2 and 0.0, respectively, the pro- 40 filtering, resulting from the rule set, meets the constraint of 

cessing system would decode the membership vector and a gain of 1.0 at the frequency (a)iC02)=(0,0). To alleviate this 

assign a membership value to the FULL_SCREEN class of problem, one of the output filter choices is assigned the 

0.2. bypass function. In bypass, Vout=Vin; i.e.; no filtering is 

Thus, in this example, the screen amplitude for the pixel done. Thus, the resulting filter, according to the concepts of 

being processed would be equal to a scalar value of 0.286 45 the present invention is F^^S/i,- F,-+(l-2/i,-)*F^ such that 

(((1.0*0.2)+(0.5*0.4)+<0.0*0.8))/(0.2+0.4+0.8)). when the desired effect is the bypass filter, the value goes to 

The screen value generating circuit 881, which may be a zero and the effect of the filter F,- is ignored, 

lookup table or a hardwired circuit, produces a screen value It is noted that the enhancement filter amplifies all higher 

based on the position of the pixel (image data) within the frequencies, and the lowpass filter attenuates the higher 

image. The weighting factor from the screen weighting 50 frequencies. The coefiBcient value, c, for the bypass filter can 

circuit 883 and the screen value from screen value gener- be determined using c^^^„=l-/^^^„„^^(filter)-/i/p^^„^_, 

a ting circuit 881 are multiplied together by multiplier 885 to (filter) so that the output filter can be described as 

produce the screen value to be added to the pixel. This ^o^f^enhance{^^^^r)*^enhance+f^io^pass{^^^^r)*^iowpass+ 

screen value is added to the image data at adder 89. ^ bypass*^ bypass- 

The image data which is summed with the screen value 55 An example of image processing an image using fuzzy 
corresponds to the same pixel as was classified by the fuzzy classification with respect to filtering the image data is 
image classifier 80. In other words, the system illustrated in illustrated in FIG. 7. As illustrated in FIG. 7, video data or 
FIG. 9 also includes buffers (not shown) to insure that the image data is fed into a fuzzy classifier 80, a lowpass filter 
pixel being processed corresponds to the correct member- 81, and an enhanced filter 82, and a bypass filter 83 in 
ship vector being used to define the processing parameters. 60 parallel. As described above, the fuzzy classifier 80 deter- 
As noted above, the problem of using a fuzzy image mines the membership vector of the pixel to be processed by 
classification system to control digital filtering is two-fold. the parallel set of filters. Note that this process includes 
First, digital filtering is not a scalar function, but a matrix buffers (not shown) to insure that the pixel being filtered 
function. Secondly, there is a constraint on the matrix corresponds to the correct membership vector being used to 
function; the filter must have gain of 1.0 at (cOjtuJK^iO)- 65 define the processing parameters. 

This ensures that constant grey areas are not altered by Based upon this classification, the fuzzy classifier 80 will 

filtering. cause the overall filter to generate a set of coefficients which 
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are applied to multipliers 84, 85, and 86. The coefficients Although the above described fuzzy classification system 

will enable the overall filter to weight the output firom the resolves some of the problems with pixel classification, the 

various filters according to the fuzzy image classification. method described to implement this system does not nec- 

For example, as noted above, if the pixel had membership essarily provide flexibility nor maximum efficiency. What is 

values for "edge," "low frequency halftone," and "pictorial" 5 desired is a segmentation architecture which allows flex- 

of 0.6, 0.7, and 0.3, respectively, the processing system ibility and efficiency such that the rules for classification can 

would decode the membership vector and assign a mem- be readily changed in the system as new information is 

bership value to the ENHANCE class of 0.7, which in turn uncovered about the nature of printed documents or a user 

is the filter coefficient assigned to the enhance filter 82 and desires to override a certain process on the fly, 

fed to multiplier 85. Moreover, if the pixel had membership lO One embodiment of the present invention provides an 

values for "high frequency halftone" of 0.6, the processing architecture which provides the above features. This archi- 

system would decode the membership vector and assign a lecture consists of imposing a rigid partitioning of the 

membership value to the LOWPASS class of 0.6, which in segmentation process into three parts. A better understand- 

turn is the filler coefficient assigned to the lowpass filter 81 ing of this concept can be seen in the illustration of FIG. 10. 

and fed to muhiplier 84. This leaves the generation of the 15 FIG. 10 illustrates a block diagram of a flexible 

coefficient for the bypass filter 83. classification/segmentation system. As illustrated in FIG. 10, 

As noted above, the generated coefficients need a rela- the video signal or image data is fed into a microclassifier 
lionship such that the sum of the coefficients will not be system 100, a macro-reduction system 200, and a pixel 
greater than 1 so as to keep the gain of the overall output classification look-up table 300. The microclassifier system 
from the filters to be equal to 1. Thus, in the preferred 20 100 is made of a plurality of microclassifier circuits 
embodiment of the present invention, the coefficient for the 101-104. Microclassifier circuits measure intrinsic proper- 
bypass filter is 1 minus the enhance coefficient minus the ties of the video extracted through a simple mathematical 
lowpass coefficient (in the example, l-0.7-0.6«-0.3). This formula or heuristic. Examples of microclassifiers might be 
coefficient is appUed to the muhiplier 86. The weighted filter detected video minimums or maximums, Laplacian, Sum of 
outputs are then fed into an adder 87 which adds all the 25 Laplacians, Average, Range of Raw Mdeo, Range of Aver- 
outputs to produce a filtered pixel of image data which has age \^deo, etc. as used in all of the references. The micro- 
been filtered according to its fuzzy image classification. classification values are fed into the macro-reduction system 

Although FIG. 7 illustrates the utilization of a fuzzy 200. 

classifier with a plurality of different filters, the fuzzy The many microclassifiers in this embodiment have dif- 

classifier can also be utilized in connection with a modular 30 ferent magnitudes when computed for images of different 

time variant two-dimensional non-separable filter wherein types. Table 1 lists qualitatively the magnitude of the raicro- 

the non-separable filter is made up of a plurality of separable classifiers for different image types, 
filters. 

The utilization of separable filters allows a non -separable TABLE 1 

filter of a particular form to be implemented in a cost 35 

effective manner. Moreover, a separable two-dimensional „ ,^ ^ ^ ^ ^ 

, , , 1' • 1 r»i . * Halftone tone Text Qintone Contone Contone 

filter can be descnbed as one-dmnensional filter that acts in shadow Edge Gray White Black 
the vertical direction or slowscan direction followed by 



another one-dimensional filter acting in the horizontal or ^^^^^^^ ^^y^^e Varying \^rying Varying High Low 
fastscandirection.Thus,thefiltermatrixcanbedescribedas40^^^^°- ^^^^^^^^^^^^ 

the product of F^;,of^*f;, wherein F^^ is an N by M matrix, Sum of High Low Highest Low Low Low 

f ^ is a N length column vector (the vertical filter) and f;, is absolute 

a M length row vector (the horizontal filter). ^x^^ y* 

If the matrix F,;, cannot be represented using the above ^^^^J^ Highest High Highest Low Low Low 

equation, the matrix is considered non-separable and the 45 vidco in 

separable filter implementation cannot be used. However, if 5x5 

the N by M matrix is represented using a singular value 

J ... i_ T- TT*o±\7 i_ IT- KT i_ KT *X « 7, 11 and Y = 3, 5 for resolution of 400 spi and 600 spi, rcspec- 

decomposition such as F^=U*S*V, where U is N by N ^j^^jy r k 
unitary matrix, S is an N by M diagonal matrix, and V is a 

M by M unitary matrix; a separable filter implementation 50 The magniUide of the responses of the microclassifiers 

can be used. Furthermore, if N and M are not equal, the shown above suggests the following microclassification 

above equation can be altered to F^^=U^*S^*V^, where niles expressed in a pseudo code. 

Q=min(n,m), U, is a N by X submatrix of U, is a Q by Vij«Video of the j-th pixel in the i-th scanline. 

Q submatrix of S, and V^is a M by Q submatrix of V. Putting Nij-Peak Count centered at the j-lh pixel in the i-th scanUne. 

this in summation form, the resulting representation will be 55 Sij=Sum of Absolute Laplacian of \^deo at the j-th pixel in 

F^^«»2s(i)*U;*v^., where i is greater than or equal to 1 but less the i-th scanline. 

than or equal to Q. Rij=Range of Video in 5x5context centered at the j-th pixel 

In this representation, the vector s(i) is a diagonal of the ^ ^"^^ scanline. 
matrix u^ is the ith column vector of U^, and v,- is the i*^ N16>N15> . . . >Nl«Thresholds in Peak Counts for Half- 
column vector of V^. Each component is a separable filter 60 loo^s of classes with firequencies HFri6>HFT15> . . . HFTl 
similar to that described above with the exception of a gain S3>S2>Sl=Thresholds in Sij. 
factor s(!). In other words, any non-separable filter of length Rl ^Threshold in Rij . 
N by M can be represented as the weighted summation of Q V2>Vl«Thresholds in Vij. 
separable N by M filters. Thus, to implement a non- If (Nij>oNl AND Sij>«S2 AND Rij>«Rl { 
separable Alter using the weighted averages of several of the 65 If (Nij<N2) 
fillers, the hardware becomes a conglomeration of Q sepa- Class«HFTl 
rable filter modules, Q multipliers, and an adder. else if (Nij<N3) 
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Class«HFT2 
else if (Nij<N4) 

Class-HFT3 
else if (Nij<N5) 

Class-HFT4 
else if (Nij<N6) 

Class-HFT5 
else if (Nij<N7) 

ClassoHFT6 
else if (Nij<N8) 

Class^HFH 
else if (Nij<N9) 

Class=HFT8 
else if (Nij<N10) 

Class»HFT9 
else if (Nij<Nll) 

Class=HFT10 
else if (Nij<N12) 

Class^HFTU 
else if (Nij<N13) 

Class=HFT12 
else if (Nij<N14) 

Class=Hm3 
else if (Nij<N15) 

Class=HF14 
else if(Nij<N16) 

Class=HFT15 
else 

aass=HFn6 
} else if (Sij>=S3) { 

aass=EDGE 
} else { 
if(Vij<Vl) 

Class=BLACK 
else if(Vii>-V2) 

Class-White 
else { 

if(Sij>=Sl) 
Class=CONTONE ROUGH 

else 

Class=CONTONE SMOOTH 

} 

} 

In the microclassificatioa rules shown above, the halftone 
class is divided into 16 frequency intervals as an example. 
The multiple halftone classes allow an adaptive filtering and 
rendering of the halftone areas. It eliminates classification 
artifacts resulting from simply dividing the halftone classes 50 
into high and low frequency halftones only. The continuous 
tone is also divided into two sub-classes, smooth and rough, 
based on the sum of the absolute Laplacian. This allows for 
more optimized rendering of the continuous tone area. More 
specifically, the multiple frequencies are used to realize the 55 
advantages of a true fuzzy classification system through a 
discrete implementation. The subclasses are created as a way 
to gradate the overall classification, thus reducing the need 
for fuzzy classification. In a classification fuzzy system, 
there would be fewer classes, for example low, medium and 60 
high frequency. 

A variant of the above microclassification rules is 
obtained by replacing the statement "else if(Sij>=S3)" with 
"else if(Sij>=S3 AND Rij>=Rl)" to obtain a more robust 
determination of the edge class membership value. 65 

The macro-reduction system 200 is made of a plurahty of 
macro-reduction circuits 201-204. Each macro-reduction 



30 



35 



40 



circuit reduces microclassification values received from the 
microclassifiers to produce higher level, more directly useful 
information through mathematical operations and heuristics, 
suppressing noise and other irrelevant or undesirable varia- 
tions in the microclassifiers. These are of^en non-linear 
operations. Examples of macro-manipulations or reductions 
might be to count video minimums or maximums in the 
neighborhood of a pixel, implementing a running average 
with a controllable attack, decay, and heuristic conditions, 
performing an auto-correlation on detected peaks, or com- 
bining two or more microclassifiers to compensate for 
undesirable fundamental effects. The macro-reduced values 
are fed into the pixel classification look-up table 300. 

The pixel classification look-up table 300 reduces the 
macro-reduced outputs to the final classification through a 
run-time programmable mechanism. The look-up table 300 
also enables any arbitrarily complex function to be 
performed, since the arbitrarily complex function can be 
precalculated off-line and loaded into the table. Microclas- 
sification values or even the input video may also be used to 
directly drive the programmable table 300 if appropriate. 
Use of a programmable table enables quick modification of 
the classification algorithm when requirements change or as 
understanding the properties of images grows, albeit within 
the framework presented at the input of the table. 

This architecture can be further modified to add more 
flexibility as illustrated FIG. 11. As illustrated in FIG. 11, a 
programmable logic array 400 is placed between the micro- 
classifier system 100 and the macro-reduction system 200. 
The programmable logic array 400 includes a plurality of 
circuits, gates, and multipliers which is programmed to 
control which microclassification value goes to which 
macro-reduction circuit and what is the weight of that value. 
This programmable logic array 400 enables the user or 
technician to reprogram the overall segmentation algorithm 
by allowing for the programmability of the feeding of the 
microclassification values to the macro-reduction circuits. 
Such a programmable logic array could also be included 
between the macro-reduction system 200 and the program- 
mable look-up table 300 to add maximum flexibility. 

In any of the architectures described above with respect to 
FIGS, 10 and 11, each stage can be easily scaled and 
interactions added or removed as requirements change. 
Microclassifiers can also be removed to save cost or added 
to extract new kinds of information from the document. 
Macro-reductions can be removed to save cost or added 
when new interrelationships are discovered or better meth- 
ods of reducing the microclassifier outputs are developed. 
The lookup table can be expanded to bring a higher level of 
programmability up to the user or reduced to save cost. 
Lastly, such architectures enable classifications to be 
dynamically added, removed, or their defining attributes 
modified both between and within pages of a multiple page 
image processing job. 

As noted above, various types of information about the 
image is needed to provide a proper image type classifica- 
tion of the image data. The information varies from deter- 
mining the peaks in the video stream to determining the sum 
of the absolute values of the Laplacian of the video stream 
One important tool in determining some of the needed 
information is filtering. Filtering has been utilized in various 
segmentation systems. Such filtering is essential in the 
measurement or estimation of the halftone frequency of the 
video signal or image data. 

As noted above, methods for classifying image pixels 
according to document types; e.g., text, continuous tone, and 
halftones of various techniques and frequencies; make deci- 
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sions based on image context in the vicinity of the pixel 
being classified. A critical piece of information extracted 
from the context is the estimate of the halftone frequency. 
The frequency is commonly estimated from a count of 
halftone peaks within the context. Such a count is really ^ 
simply the application of a two-dimensional blur filter with 
rectangular amplitude profile in the space domain. 

One embodiment of the present invention is directed to an 
efficient and inexpensive method for implementing a trian- 
gular blur filter to produce either a low pass or a high pass 
fiher The high pass filter is derived by the usual conven- 
tional method of subtracting the low pass output from the 
original input. The implementation of the present invention 
requires only two adders and two delay elements and rio^s 
multipliers. In addition, the present invention provides for- 
inexpensively re-synchroniging data , which is delayed jiift> 
to the filtering, wit^any associated i nput tags wpcrem tags 
"Can be any auxiliary aata associatea with the im age data or 
V ideo sireain, M\ as Image dIii>^>m(i alion inlormahon! 20 
^ In implementing the filter, the present invention redun- 
dantly performs the calculation on the trail edge instead of 
storing the result to use it later This is only done in the 
slowscan direction. In the fastscan direction, the results are 
stored in a series of latches. The reasoning behind this 
implementation will become more clear from the description 
below. 

In the fastscan direction, the "calculation" is that of taking 
gray input video and detecting peaks, yielding a 1-bit output. 
Conventionally, it is a very expensive operation because it 
requires a large number of gates. On the other hand, it is 
inexpensive to delay this one bit output by a few pixels 
(8-18) with flip-flops, so it is delayed. In the slowscan 
direction, the "calculation" is a peak counter within an 
one-dimensional window, inputting a 1-bit number and 
yielding a 4-bit number. This 4-bit result would have to be 
delayed 8-18 scanlines, times some 5000 pixels per 
scanline, costing several hundred thousand flip-flops, or 
more likely an external RAM and control circuitry. In view 
of this inefficiency, the present invention duplicates the 
first-pass block to recreate the result when and where needed 
during the second pass rather than using the conventional 
method of taking the output of the first pass and providing 
it as input to the second pass as two pure blocks. 

"For many different applications, it is desirable to filter a 
given input signal or image. TypicaUy for signal and image 
processing applications, finite impulse response filters are 
preferable because of several properties these filters pos sess, so 
specifically, linear phase and guaranteed stabihty. In addi- 
lion to , the input/video that is to be filtered, additional 
properties/data associated with- the input signal/video may 
be coupled with the signal/video as it flows through the 
system. This additional information may be processed by or 
control downstream modules. This additional information 
often contains information corresponding to the classifica- 
tion of the individual pixels of the signaWideo, such as 
membership information, and may be referred to as the 
signal/image tag or a classification vector. In the description 
below, the tag at a certain lime/position k will be referredTft 
as T(k). Thus, the entire input to be processed is the union 
of both the signal/video plane and one or more tag or 
classification vector planes. 

For the application of filtering, it is desirable to filter the 65 
signal/video input and then lake the corresponding output 
and reunite it with the appropriate tags. An example of such 
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an application of filtering can be expressed utilizing the 
following equation for a rectangular filter of length 2N+1. 



H(z)-3t:;Z*V={-N,-N+l, . . . 0, . . . N-1,N} 
y(k)-2ptOc+i),i-{-N, -N+1, . . . 0, . - . N-1,N} 

In these equations, H(z) is the z-transform of the filter and 
x(k) and y(k) are the inputs and outputs of the filter, 
respectively. Although this filter has poor frequency 
response characteristics, it is often used because it is very 
inexpensive to implement. 

In order to implement a rectangular filter in real time and 
inexpensively, the following rectangular filter is convention- 
ally utilized. 

HCz)-(i-z-^-^)/(z-^-z-*-') 

y(k-N)=yCk-N-l)+x(k)-x(k-2N-l); 

To get the present sum fi-om the above equation, the 
previous sum is used and one video input is subtracted while 
another is added. In addition, none of the terms H(z) are 
functions of positive exponents of z and thus they do not 
depend on future values of x, only on the present and 
previous values. This type of rectangular filter makes it easy 
to implement in real time and/or in pipeline systems where 
all of the data is not available at once but is presented serially 
to the filter. Unlike a typical filter, which requires several 
values of the input and several multipliers/adders, all that is 
needed to implement this rectangular filter is a delay element 
of 2N+1 and two adders. 

Unfortunately, it is noted that at time (or position) index 
k, the output is known up to time k-N. At this time however, 
the tag or classification vector is at time k, T(k) is the value 
available in the pipeline. In other words, the output video 
and the associated lag or vector have become unsynchro- 
nized. In order to re-synchronize the output signal/video 
with the proper tag, the tags must be delayed by N sample 
times. The overall configuration therefore requires one sig- 
nal to be delayed by N (the tags), and another signal be 
delayed by 2N+1 (the video). If expensive first-in first-out 
(FIFO) buffers are utilized for these delays (as in the case of 
two-dimensional image processing), the expense of the two 
delay blocks can be very large. In addition, such an imple- 
mentation would require two different lengths of FIFO 
buffers or two identical FIFO buffers controlled differently. 

Furthermore, a single rectangular filter is not the optimal 
fiher to utilize because its side lobes in the frequency domain 
are very significant. Thus, to create a better filter, one 
embodiment of the present invention utilizes a rectangular 
filter that is used successively to provide an equivalent 
triangular filter wherein the triangular filter is generated by 
the convolution of two rectangular filters. Such a triangular 
filter can be expressed utilizing the following equations. 

z(k-2iN-l)-z(k-2N-2)+y(k-N-l)-y(k-3N-2); 

y(k-N)=y(k-N-l)+x(k)-x(k-2N-l); 

In the above equations, y(k-N) is a first state sum as in a 
rectangular fiUering and zQc-2N-l) is the final output from 
the triangular filter. This triangular filter equation can be 
reduced to: 

z(k)-Z(2N+l-[i-2N-lD x(k+i-2N-l),i-0 to 4N+1 

The triangular filter has superior frequency properties 
when compared to a rectangular fiUer ascleariy illustrated in 
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FIG. 21. FIG. 21 shows the Fourier transform of both filters 
when each is 15 elements long. It is noted that to implement 
this triangular filter, the triangular filter requires four adders/ 
sublractors. In addition, the triangular filter requires the use 
of two delay blocks instead of one but both have a fixed 5 
delay of 2N+1. If a single delay block is wide enough, it can 
be used to delay both the tagsA^ectors and the signal. In other 
words, instead of two delay blocks, a single block of twice 
the bit width can suffice. 

Additionally, the triangular filter has the property that the lO 
overall delay between the input and output is 2N+1, unlike 
the delay of N in the rectangular filter. Thus, utilizing a 
triangular filler requires that the tags l(k) be delayed by 
2N+1 to be re-synchronized with the output video. This 
delay is identical to the delay needed for the input signal 15 
x(k), an intermediate state y(k) in the equation above 
describing the triangular filler. Therefore, the same type of 
delay block can be used for the image elements as well as the 
tags. If the wordlength of the delay block is large enough, a 
single delay block can be used to delay x(k), y(k), and t(k). 20 
The triangular filter can be implemented using only a single 
delay element of 2N+1 states with four adders and sublrac- 
tors. In addition, one multiplier can be used to normalize the 
fiher so that it is low pass in nature with a unity gain. The 
normalization can also be done at the end of the first slate: 25 
i.e., on y(k) lo reduce the bit to be carried into the second 
state sun. 

With only minmal additional hardware, a rectangular filter 
can be easily replaced by a triangular filter, which has better 
frequency response characteristics. Moreover, the tags in the 30 
signal can be readily re-synchronized using common hard- 
ware to that which is needed by the filter itself. In addition, 
the triangular filter can be used to implement a high pass 
filter as well. The signal/video input can be delayed along 
the tag vector so that at every index k there is the original 35 
signal, the low pass signal (via the triangular filter) and the 
tags. The high pass filter output is easily created by sub- 
tracting the low pass output from the original input video. 

This process can be repeated to provide "smoother" low 
pass filters wherein the filter becomes Gaussian as the 40 
procedure is infinitely repeated. The cost per stage is linear, 
but the incremental benefits of the low pass filter and the 
frequency response decrease rapidly with each additional 
stage. 

Finally, the process discussed above can be extended to 45 
two dimensions by using a triangular filter in the horizontal 
direction and a triangular filter in the vertical direction. An 
example of a typical application of a two-dimensional filter 
is the utilization of a two-dimensional rectangular function 
to estimate the spacial frequency (measured in lines per inch, 50 
or LP!) of a halftone image given the peak map of the image. 
More specifically, such an estimation of the frequency is 
needed in any process requiring automatic halftone detection 
such as automatic image segmentation. In such an 
application, a filter is used to process a peak map and 55 
thereafter each point is converted to a frequency estimate by 
taking the square root of the peak count within a context 
centered on the pixel whose containing halftone cell's 
frequency is being estimated and multiplying it by a constant 
which is a function of the filter size. An ideal frequency 60 
measurement map would have a standard deviation of 0.0 
since the input image is a constant halftone. 

As discussed earlier, the triangular filter can be imple- 
mented as the convolution of two successive rectangular 
blur filters. This implementation has the advantage of main- 65 
taining synchronization between the video being filtered and 
any information in a parallel channel, such as classification 



tags. Therefore, determining the optimal implementation of 
a two-dimensional triangular filter reduces to the problem of 
determining the optimal implementation of a two- 
dimensional rectangular blur filter. Furthermore, while the 
desire to minimize cost and complexity drives toward fil- 
tering of tiled rectangular blocks at fixed locations in the 
image, this leads to rectangular artifacts in the classification 
map, which in turn results in objectionable rectangular 
artifacts in the resulting image whose processing is driven 
by this map. This understanding adds the additional require- 
ment that the optimal implementation of a two-dimensional 
triangular filter must filter each pixel based on a context 
which is centered on that pixel. 

The present invention provides a fast, hardware efficient 
method of achieving the goals outlined above: [1] Imple- 
menting two concurrent two-dimensional rectangular blur 
filter on a raster image flowing through a pipeline, yielding 
a triangular filter, [2] Filtering based on a context centered 
on each pixel being filtered, thereby avoiding rectangular 
artifacts, and [3] Maintaining synchronization of the filtered 
output image with a unfiltered auxiliary data channel. In the 
event that the data needs to be processed through additional 
rectangular blur filters to obtain better frequency 
characteristics, only one set of control circuitry is required 
for all instances. 

To better explain the two-dimensional rectangular blur 
filter of the present invention, FIGS. 22 through 24 will be 
described below. 

FIG. 23 is a block diagram of a circuit which detects the 
peaks of the video stream and counts the number of peaks 
within a programmable neighborhood around the pixel being 
classified. The video stream is received by a peak detection 
circuit 600 which detects the peaks within the video stream 
utilizing conventional peak detection methods. This circuit 
outputs a binary 1 if a peak is detected and a binary 0 if no 
peak is detected. The peak detection information is fed into 
a peak counter circuit 500 wherein the number of peaks in 
the context neighborhood is determined. 

The peak counter circuit 500 includes a programmable 
delay line 602 which provides trail edge peak detection 
information. In other words, to determine the number of 
peaks in a neighborhood that is moving as the classification 
process goes from one pixel to the next pixel, the counter 
must not only count the incoming peak detection 
information, but must also remove the peak information 
relating to pixels now outside the neighborhood. The pro- 
grammable delay line 602 provides the trail edge peak 
detection information which is the peak information relating 
to pixels now outside the neighborhood. The delay line 602 
is programmable so that the system can be programmed to 
implement varying target neighborhood sizes based on 
image resolution, apriori knowledge of the document being 
segmented, and system level tradeoffs. 

The trail edge peak detection information, along with the 
current peak detection information from the peak detection 
circuit 600, is fed into a peak counter 604 which adds the 
current peak detection information to a current count, pro- 
vided by a latch circuit 606 and subtracts the trail edge peak 
detection information provided by the programmable delay 
line 602 from the current count. In this way, the peak counter 
604 can provide an accurate current count to the latch 606. 

One way to implement a two-dimensional rectangular 
blur filter using the peak detection/counter circuitry of FIG. 
23 is illustrated in FIG. 24. The video stream, as illustrated 
in FIG. 24, is received by a peak detection circuit 600 which 
detects the peaks within the video stream utilizing conven- 
tional peak detection methods. This circuit outputs a binary 
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1 if a peak is detected and a binary 0 if no peak is detected. 
The peak detection information is fed into a fastscan lead 
edge peak counter circuit 608 wherein the number of peaks 
in the context neighborhood is determined. The fastscan lead 
edge peak counter circuit 608 contains the same components 
as the peak detection circuit 500 of FIG. 23. 

Since the filtering needs to be done in two dimensions, the 
peak information within a context extending in the slowscan 
direction must also be included in the peak count determi- 
nation. To facilitate the inclusion of slowscan peak count 
information, a slowscan trail edge FIFO buffer 610 is 
included. A slowscan lead edge peak counter is not required 
because the scanning on the image data is along the fastscan 
direction such that when the scanning moves to the scanline, 
the peak information for that scanline will be provided by 
the fastscan lead edge peak counter 608 as it scans across the 
image data, whereas the slowscan trail edge FIFO buffer 610 
will provide the slowscan peak count information that needs 
to be removed from the accumulated count because that 
scanline is no longer part of the neighborhood being ana- 
lyzed. This FIFO buffer 610 must be large enough to store 
information for each pixel in the fastscan direction (typically 
on the order of 2,500-10,200 pixels) and for the number of 
scanlines that define a neighborhood. 

The peak count information from the fastscan lead edge 
peak counter 608 is fed to a sumner 612 and the slowscan 
trail edge FIFO buffer 610. The summer 612 adds the peak 
count information from the fastscan lead edge peak counter 
608 to a previous count value for that pixel column that was 
stored in a scanline FIFO buffer (not shown) to generate the 
real count value for the pixel being classified. This real count 
value is also fed into a summer 614 wherein the real count 
value is modified so that the peak count information from the 
slowscan trail edge is removed. This modified value is stored 
in the same scanline FIFO buffer. 

The system in FIG. 24 produces a two-dimensional rect- 
angnlar blur filter that eliminates rectangular artifacts 
induced by the filter, but requires large amounts of high 
speed memory, thus having a negative impact on its cost. 
FIG. 22 illustrates another implementation of a two- 
dimensional rectangular blur filter which does not require a 
large amount of memory and produces identical results. 

As illustrated in FIG. 22, peak detection information from 
a peak detection circuit (not shown) is received by a latch 
501. The latched information is fed to a fastscan trail edge 
peak counter 503 which has essentially the same contents as 
peak counter 500 in FIG. 23. The trail edge count informa- 
tion is fed into a latch 504 and this latched information is fed 
into a summer 505. 

Peak detection information from a second peak detection 
circuit (not shown) is received by a latch 513. The latched 
information is fed to a fastscan lead edge peak counter 514 
which has essentially the same contents as peak counter 500 
in FIG. 23. The lead edge count information is fed into a 
summer 510. Summer 510 also receives latched peak count 
information fi-om latches 508 and 509. This latched infor- 
mation represents the old count value that had been stored in 
a scanline FIFO buffer (not shown). The resulting sum from 
summer 510 is the current count value for the pixel being 
classified as it leaves the latching circuit of latches 511 and 
512. 

The resulting sum from summer 510 is also fed to summer 
505 through latch 507 wherein summer 505 subtracts the 
latched count value of the fastscan trail edge counter 503 
from the latched summed value from summer 510. TOs 
value is latched by latch 506 before being stored in the 
scanline FIFO buffer. It is noted that the various latches in 
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FIG. 22 are merely for keeping the various counts for the 
active pixel column properly synchronized with the pixel 
image data, and thus, these latches do not go to the crux of 
the invention since more or less latches may be needed in 
5 order to maintain this synchronization depending on detailed 
design details. 

The operations of the two-dimensional rectangular blur 
fiher as illustrated in FIG. 22 will be more precisely 
described below. 

jQ In order to calculate the output of the blur filter for a 
context trail edge pixel position in both the fastscan and 
slowscan directions, two fastscan blur filters (one at the 
slowscan lead edge of the context and one at the slowscan 
trail edge of the context) feed P slowscan filters wherein P 

35 is the number of pixels per scanline in the fastscan direction. 
The state of the P slowscan blur filters is stored in a ring 
buffer and restored, updated, and saved every clock cycle as 
each pixeFs contribution is incorporated. The details of the 
process are as follows: 

20 The filter of the present invention blurs pixels in a 1 pixel 
slowscan by N pixel fastscan strip as it slides in the fastscan 
direction at the slowscan lead edge of the incoming image: 
Given: 

N: Fastscan length of the blur filter 
25 X: Current pixel position on the scanline. 

Wx' Input video value at the current pixel position. 
FSPS;^.: Fastscan Partial Sum consisting of the sum of 

the N-1 input pixels preceding the current pixel on 

the scanline. 

30 The blur filter output consisting of the sum of the N pixels 
trailing and including the pixel at position X is FSBLUR;|r« 
FSPSjjH-y;^^ and the new partial sum for the next column 
position is: FSPS;^.^joFSBLUR;^^V;4._(^.j). The N-1 trailing 
video values are saved in a FIFO P elements long so that 

35 random and repeat access to the image is not required. At 
each pixel location ^x-iN-i) ^ immediately available from 
the output of the FIFO and V^^, is written to the input. 

An output image consisting of the fastscan blurred video 
can then be piped into a similar process to perform the 

40 slowscan blurring. However since the data is presented in a 
fastscan/slowscan raster format in order to blur each column 
by M pixels in the slow scan direction, M full scanlines of 
(BPP+fLog2 N]) bit wide storage would be required wherein 
BPP indicates the number of bits per pixel. Instead, the 

45 original BPP bit wide input data can be delayed and pre- 
sented to a second fastscan blur block identical to that 
described above. This second fastscan blur can concurrently 
recalculate the required quantity (FSBLUR;^.^y_(;v^_jj) in the 
description below) for use by the slowscan blur operation. 

50 In the event that the original BPP bit wide input data is 
already being delayed and presented at the slowscan trail 
edge for other reasons, this in effect saves all M scanlines by 
(BPP+[Log2 Nl) bits of storage. For example, if BPP=1 and 
N=16, 5 bits wide by M scanlines of storage are saved. If 

55 multiple instances of the rectangular filter are applied, [Logj 
N] bits by M scanlines of storage are saved over the 
(BPP+fLog2 N]) required where the fastscan blurred video 
is simply piped to a second stage. This process is described 
in detail below. 

60 For each column in the fastscan blurred image, the filter 
of the present invention blurs a M pixel slowscan by 1 pixel 
fastscan strip as it slides in the slowscan direction at the 
slowscan lead edge of the incoming image. 
Given: 

65 M: Slowscan length of the blur filter 
X: Current column position 
Y: Current scanline position 
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FSBLUR;^ y: Value at the current column/scanline posi- to overcome this problem, one can require that the pixel is 
tion of the fastscan blurred input video greater than or equal to all of its neighbors. The problem 
FSBLUR;t-^i._^A/-i))' Value M-1 scanlines up from the associated with this implementation is that the flat top area 
current position of the fastscan blurred input video. may be present as multiple pixels. As an example, utilizing 
SSPS^ Slowscan Lead Edge of context Partial Sum 5 a peak function P(i, j) which is one where there is a 
consisting of the sum of the M-lfastscan blurred maximum, and zero otherwise, a typical peak detection 
values in a column above the current position. process for pixel in the i-th scanline j-th pixel location is as 
The slowscan blur filler output consisting of the sum of follows: 
the M Fastscan blur filter output values trailing and includ- 
ing the pixel at position X,Y is SSBLUR;^_y=SSPS;^.y+ Peak(ij>iifiv(ij)>v(i-k.j-i), wth -i^k. 1^1 and k and | are 
FSBLUR;^ y, and the new partial sum for the next row, same not both zero, otherwise 0 
column position is SSPSj^.(y^ij"SSBLUR;4, j^FSBLURj^-^y. 

(Af_i)). ' ' ' If there are dual peaks, for example: v(i,j)=v(i+l J), nei- 

In the fastscan case, the new partial sum would be used on Iher (i j) or (i+lj) will be considered locations of a local 

the next pixel processed. However in the slowscan case, maximum. This process will therefore tend to underestimate 

because of the faslscan/slowscan raster presentation of the the number of local maximums of the image. In halftone 

data, the column-wise partial sum must be saved until the measurements, this would result in a frequency estimate 

same column in the next row is processed. Since each value iQ^^^r than the true frequency. Similarly, if the strict inequal- 

is created only once, used only once, and every value is used jty „j ^^^^ equation set forth above is changed to a weak 

P-1 pixels after it is created, the column partial sums can be mequaUty (^), aU flat top peaks will be counted multiple 

saved m a HFO memory. For each column, the partial sum 20 ^-^^^^ j^^j overestimate the number of local 

for the current position is stored during the previous row. ^jaximums in the image. 

This value is read from one end of the FIFO, and the new ^^^^^ ^^^^ problems adjacent peaks can be 

partial sum calculated for the current column is wntten to the disallowed where the peak pixel and aU intervening pixels 

other. The next HFO read wiU be the partial sum for the next ^^^^^ j^^^j ^ ^^^^^^^ ^^fo^^^. 

column position. These operations are saving and reslonng 25 increases the complexity of the process and requires 

the state of the P slowscan blur filters which exist for each ^^^^ j^^^, ^ ^^^^^^ ^^^^^ calculations. In 

of the P pixel columns in the image. ^^^^ implementation, this is costly because storing hne 

It is noted that while the overall two-dimensional blur ^^^^^^ real-time calculations is expensive in high speed 

filter requires storage for M-1 scanlines of the onginal systems 

image, access is only needed at the upper and lower right 30 embodiment of the present invention resolves this 

corners of the blur rectangle. A HFO memory can be used, ^^^^^^^ ^ combination of the strict and weak inequal- 

thereby reducing control circuitry and Input/Output pms -jy implementation of the weak and equality can be 

required to a minimum: data mput/output and simple read/ direction sensitive. It is only imposed at a particular direc- 

write signals. If multiple passes through a rectangular blur ^ ^own and/or to the right. In this case, the inequal- 

filter are required to achieve a frequency response with 35 ^^^^^ equation is changed to: 
lower side lobes, a second instance of the process described 

above can be implemented concurrently, with the M scanline Peak(ij)=iiff 
FIFO memory element and the per column accumulator 

storage being made wider but using the same control cir- v(ij)^v(i+k,j+i), w-ith (k,i)-{(o,i),(i -i),(i,o),(i,i)} 

cuitry as the first stage. In this case, the present invention's 40 v(ij>v(i.kj>i), with (k,i)»{(-i,-iX(-i,0X(-i,i),C0,-i)} 

use of a duplicate fastscan blur block m the second stage ^ v j 

saves fLog2 M*Nl bits in width of the M scanlines of xhis inequality will count multiple peaks only one time, 

storage. If N equals 10 and M equals 15, the M scanline since it only allows it as it approaches from only one 

FIFO memory element width required by the second filter is direction. In this example, it will consider the upper left to 

reduced by \L0g2 15011«8 bits over the conventional 45 be the peak location. Unfortunately, this process has also a 

sequence approach of implementing a separable two- shortcoming. The process will detect peaks in areas where 

dimensional filter staggered lines are encountered. The upper left area of each 

Advantages from the above-described 2-D blur filter jagged section will be detected as a local maximum. This 

implemenution are that access to the data is required only type of staggering is, for example, in text areas where 

after a single fix delay, thereby allowing the use of FIFO so diagonal lines are present such as in part of the letter A. In 

storage elements. Rephcation of critical functions reduces many instances, the detection of these areas as peaks many 

the required width of the FIFO storage elements. Regularity not be undesirable. For halftone detection, however, it may 

in the process utilizing the rectangular blur filter allows result in a false detection of a halftone area, 

multiple instances of the filter to be ganged together, thereby Iq order to eliminate detecting local maximums in this 

sharing control circuitry in processing the data concurrently 55 area, the second equation above is updated to place the 

to realize multiple passes through the filter without a cor- diagonal inequality in the upper left. This eliminates block 

responding increase in cost and complexity. areas that are typically present in text. The updated peak 

Moreover, peak detection routines are often used in image detection is as follows: 
processing to help find maximum and minimums of the 

images. In halftone detection algorithms, the peak detection 60 Peat(ij)-iiff 

is often used in determining the dot density or frequency of r v , 

the halftone being measured. This is a common measure- v(ij>-v(i+kj.i). with (k,l>.{(-3,-lM-i,iMO,3),(l,0)} 

ment used in segmentation. In many detection routines, the v(ij)>vCi+kj+i), with (k,i)-{(-i A(0 -iXO,!)} 
maximum is found by searching for a local pixel that is 

greater than all its neighbors. 65 If this peak detection is utilized on a halftone gray original 

The problem with this process is that it ignores flat top at 100 lines per inch with 45 degree orientation. Under ideal 

peaks; i.e., peaks that are more than 1 pixel in width. In order peak detection, the peak map will form a hexagonal grid. 
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Thus, given perfect peak deieciion, the firequency measure- 
ment would have a mean of 100 and a standard deviation of 
0. Using the first described peak detection process above, the 
mean of the frequency measurement would be 91.5 lines per 
inch, the range 61 lines per inch to 100 and 9 lines per inch 
and the standard deviation 5.6 lines per inch. Using the third 
described peak detection, the mean frequency is 99 lines per 
inch, range 91 lines per inch to 111 lines per inch and the 
standard deviation is 2 lines per inch. 

As noted above, the detection of halftones is very impor- 
tant. The detection of halftones involves finding the center of 
the halftone dots and counting the number of dot centers in 
a small window. The presence of enough numbers of dots as 
well as additional conditions qualifies an area as a halftone. 
The frequency of the halftone can be inferred from the dot 
counts in the window of a known size. The dot centers are 
identified by finding the local video minimums in the 
mid-tone to highlight area and finding the local video 
maximums in the mid-tone to shadow areas. To suppress 
false detections in a noisy non-halftone area, the center pixel 
is required to be less than the average of the surrounding 
pixels by a small threshold in case of minimums and greater 
than the average of the surrounding pixels. by a threshold in 
case of maximums. 

The process described above usually detects the halftone 
area accurately. However, in images with small kanji, thin 
lines, or ladder charts, noises in the lines can trigger false 
detection of halftone dot centers. If there are enough half- 
tone peaks, the local area could be classified as halftone 
rather than text or line arts. The kanji or ladder charts would 
then be treated as high frequency halftones, thereby causing 
the kanji or ladder chart image data to be low pass filtered 
and screened. The result is the loss of sharpness in repro- 
duction. Thus, additional constraints are necessary in quali- 
fying the pixel as a halftone peak. One embodiment of the 
present invention provides an efficient way to reduce the 
number of false detections of halftones in these areas. 

To explain the present invention more clearly, the follow- 
ing example will be discussed. As shown in FIG. 13, FIG. 13 
illustrates a halftone image which is reproduced at 133 lines 
per inch and has been magnified. Using the peak detection 
process described above, the pixels detected as halftone 
peaks are shown as black dots in FIG. 14. The resuh of 
applying the simple halftone peak detection to a ladder chart 
as shown in FIG. 15, and kanji shown in FIG. 18. The false 
detection of halftone peaks are obvious from the illustrations 
of FIGS, 16 and 19 wherein these Figures represent maps of 
the detected halftone peaks. To suppress these false 
detections, it is necessary to apply additional criteria to 
qualify a pixel as a halftone peak. 

If the video signal is denoted by V,-^- wherein i is the / scan 
Une and j is the j pixel, the false detections can be suppressed 
by computing the range of the video in the horizontal, 
vertical, and diagonal directions across three or five adjacent 
pixels. For example: for 400 spi Rj:,^^„,„/»Range(V,.^._i, 
V,^, V,^.,,) and R,,^.,,;=Range(V,_,^, V,^, V,,,^-); for 600 

Range(V._,^.^„ V,.^., V,.,i^._J; and R^,„^^^/2'= Range 

The variations of the video across a few pixels along the 
direction of a line is expected to be small. If any of these 
computed video ranges is small at a pixel location, it is likely 
that a line has passed through the pixel location. These 
ranges arc all expected to be somewhat larger when the 
center pixel is a real halftone peak. Therefore, the following 
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additional condition is imposed upon the center pixel to be 
classified as a halftone peak. 

R//o.«^/>S and R^i«,P'S and Ro«««„^>S and Ro,„„-/2>S 

5 

With this additional constraint, the halftone peak detec- 
tions for the images shown in FIGS. 15 and 18 are shown in 
FIGS. 17 and 20, respectively. The false halftone peak 
detections are significantly reduced. On the other hand, for 

■'^ a real halftone image as illustrated in FIG. 13, the additional 
constraints put on the center pixel do not reduce the nmnber 
of halftone peaks detected. 
As described above, a compound document can be auto- 

j5 matically segmented into areas of different classes. Several 
of the classifiers (microclassifiers) used in this process 
depend on the video of neighboring pixels. Since these 
classifiers (microclassifiers) depend on the video of neigh- 
boring pixels, these classifiers (microclassifiers) are very 

20 sensitive to the resolution of the actual scanned in document. 
For example, the common scanning resolutions are 400 
spots per inch and 600 spots per inch. If a classifier 
(microclassifier) is built to correspond to a 400 spot per inch 
resolution, the classifier (microclassifier) may not provide a 

25 correct or optimal output if the image being fed through the 
classifier is 600 spots per inch. 

Therefore, it is desirable that the classifiers 
(microclassifiers) are implemented such that the classifiers 
(microclassifiers) are made less sensitive to the actual reso- 
lution of the video signal being processed. One embodiment 
of the present invention realizes this desirability by adjusting 
the size of the context or number of neighboring pixels 
which will be utiUzed by the classifier (microclassifier) in 

35 proportion to the scanning resolution of the image. By 
adjusting the size of the context in proportion to the scanning 
resolution of the image, the computed classifier 
(microclassifier) can be made less sensitive to the actual 
resolution. More specifically, the classifier (microclassifier) 

^ of the present invention has the built in flexibihty such that 
the combination of these resolutions in either fastscan or 
slowscan directions can be readily accommodated by merely 
adjusting the size of the context in which the classifier 
(microclassifier) operates upon. 

For example, in a halftone peak detection circuit, the 
halftone area is detected by the presence of enough halftone 
dots in a window of known size. The halftone dots are 
identified by the local minimum of the video in the mid-tone 

50 to highhght area and the local maximum of the video in the 
mid-tone to shadow area. If the video signal is denoted by 
V,.j for the j-th pixel in the i-th scanline, one of the necessary 
conditions for the local video minimum is as follows: 

55 ^i-ij-i^^ij and V,.i^<V,^. and V,_,,.,, SV^. and V,^..,<V,^ and 

V^^^i^Vy and V;,,j.,<V,j and V,,,j<V^^ and V(,,^,,<:V,j 

For a resolution of 400 spots per inch, the checking 
specified above is adequate. However, if the scanning reso- 

60 lution is increased to 600 spots per inch, the necessary 
conditions must be extended to include more pixels, namely 
V,^_2<V,j and V.j^^^y.j for the fastscan direction and 
V._2^<V,^. and V.^2>= ^.v slowscan direction. For the 
local maximum of the video similar conditions must be also 

65 extended wherein the less than and less than and equal are 
replaced by greater than and greater than or equal, respec- 
tively. 
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Furthermore, in order to reduce the false peak detection in 
areas of kanji, thin lines and ladder charts, as discussed 
above, the following conditions are also analyzed wherein S 
is a small threshold. 

5 

Rangc(Vy.„V,j,V,^,i)>S and Rangc(V,_i^V,j,V,,,j)>S 

As noted before, if the image has a resolution of 400 spots 
per inch, the conditions set forth above are adequate. 
However, if the image has a resolution of 600 spots per inch, 
it is also necessary to include more pixels in this condition, 
namely Range(V,.^.,2, V,v_a' and Range 

(V,-2^-, V,.^., V,-^3^, V,^2j.) for the higher resolution in 
the fastscan and slowscan directions, respectively. 

As noted above, the number of local video minimums and 
maximums in a window of known size can be used to 
estimate the halftone frequency. In actual implementation, a 
double convolution using a window of uniform weights is 
applied to 1 bit per pixel bitmap of halftone peaks. This is 
equivalent to sampling the peak counts with a window of 
twice the size with a triangular weighting function. The size 
of the window for the convolution can be 8, 10, 12, 15, or 
18 pixels in a fastscan direction and 8, 10, 12, 15 or 18 
scanlines in the slowscan direction. Typically, if the scan- 
ning resolution of the image is 400 spots per inch, the 
window size for this implementation is usually 10 pixels in 
the fastscan direction and 10 scanlines in the slowscan 
direction. On the other hand, if the image has a resolution of 
600 spots per inch, the typical window is 15 pixels in the 
fastscan direction and 15 scanlines in the slowscan direction. 
Thus, as noted above, to properly detect the halftone peak 
counts, the context of the window being analyzed must be 
adjusted according to the resolution of the incoming image. 

To characterize continuous tone verses halftone, text and 
images, a measure of energy or variation of the video is ^5 
computed. In this implementation, the Laplacian of the 
video is first computed as follows; 
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More specifically, the Laplacian is simply the difference 
between a pixel and the average of its eight neighbors. After 
computing the Laplacian, the sum of the absolute values of 
the Laplacians over a small neighborhood of pixels by 
Ny scanlines surrounding the pixel of interest is computed. 45 
The parameters is chosen to be 7 for a 400 spot per inch 
image and 11 for a 600 spot per inch image. The parameters 
Ny are chosen to be 3 for a 400 spot per inch image and 5 
for a 600 spot per inch image. A pixel is classified as either 
a smooth contone, rough contone, or edge class depending 50 
on its value of the absolute sum. Large absolute Laplacian 
sums are also a necessary condition for a pixel to be 
classified as a halftone. 

Lastly, it is also desirable to detect line, text, or edges 
inside a halftone image or tint. Such a detection can be 55 
realized by first calculating a local average of the video over 
a suitable chosen context wherein the local average is 
computed for each pixel. Next the range of the computed 
averages over a small neighborhood is checked. If the range 
exceeds a certain threshold, it is likely that the pixel is an 60 
edge pixel. For high frequency halftone tint, the context for 
averaging the video is three pixels or scanlines for an image 
at 400 spots per inch revolution and 5 pixels or scanlines for 
an image at 600 spots per inch resolution in order to 
effectively smooth out the halftone tint area. 65 

As readily seen above, the various classifiers utilized in an 
auto-segmentation process requires the context of the win- 



dow being analyzed to vary from resolution to resolution of 
the input image. Therefore, the present invention adjusts the 
context of the window based on the resolution of the input 
image. In other words, the implementation of the automatic 
image segmentation process has the built in flexibility to 
effectively analyze the image at any input resolution wherein 
this is accomplished by varying the context required for the 
classifiers (microclassifiers) that the determine the segmen- 
tation in proportion to the input resolution. 

An example of such a system is illustrated in FIG. 12. As 
illustrated in FIG. 12, the video signal or image data is fed 
into a microclassifier system 100, a macro-reduction system 
200, and a pixel classification look-up table 300. The micro- 
classifier system 100 is made of a plurality of microclassifier 
circuits 101-104. These microclassifier circuits measure 
intrinsic properties of the video extracted through a simple 
mathematical formula or heuristic. The microclassification 
values are fed into the macro-reduction system 200. The 
microclassifier system 100 also receives the resolution of the 
incoming video signal. This information is used by the 
microclassifier circuits to conform their calculations and 
measurements to the actual resolution of the video signal. 

The macro-reduction system 200 is made of a plurality of 
macro-reduction circuits 201-204. Each macro-reduction 
circuit reduces microclassification values received from the 
microclassifiers to produce higher level, more directly useful 
information through mathematical operations and heuristics, 
suppressing noise and other irrelevant or undesirable varia- 
tions in the microclassifiers. The macro-reduced values are 
fed into the pixel classification look-up table 300. To the 
extent to which the expected microclassifier values are 
impacted by the resolution driven adjustment of their 
calculation, the macro-reduction parameters must be 
adjusted by either a control processor or adapted accordingly 
by the system 

The pixel classification look-up table 300 is a program- 
mable look-up table reduces the macro -reduced outputs to 
the final classification in a very fast manner. The look-up 
table 300 also enables any arbitrarily complex fimction to be 
performed, since the arbitrarily complex function can be 
pre-calculated off-line and loaded into the table. Microclas- 
sification values or even the input video may also be used 
directly by the programmable table 300 if appropriate. 

Document processing systems often segment images into 
different classifications, such as contone, text, and halftone 
in order to optimize the processing and rendering for each 
classification. All continuous tone images areas are normally 
classified the same, resulting in the same processing and 
rendering being applied. Continuous tone classifications can 
be expanded to indicate how rough or smooth each area, or 
the area surrounding each pixel, is. This enables more 
aggressive enhancement techniques to be applied to the 
rough areas, and noise reduction techniques to the smooth 
areas, resulting in less grainy images while retaining or even 
enhancing image sharpness. 

As noted above, document processing systems often need 
to take an image representation with a low level or organi- 
zational strucmre, such as a simple bit or byte per pixel 
rectangular video map, and infer structure to guide further 
processing or analysis of the image. For example, in a 
reprographic system, a document is optically scanned and 
the reflectance of each pixel is converted to an 8-bit digital 
value, forming an 8-bit per pixel matrix of integers repre- 
senting the original document. Various image processing is 
then performed to enhance the image and convert it to a 
different resolution and form which can be rendered by a 
particular marking technology, such as a bi-level xero- 
graphic printer. 



07/09/2004. EAST Version: 1.4.1 



us 6,3' 

39 

The purpose of this processing is lo produce an output 
copy which appears to the human eye to be equal to or in 
some sense better than the original. Uafortunately, the 
appropriate image processing is different for different types 
of input documents, typical broad classes being text, con- 
tinuous tone, and halftones of various techniques and fre- 
quencies. Furthermore, many user originals contain more 
than one document type in different areas of the page, for 
example a magazine article with inset halftoned 
photographs, or text embedded in a tinted background. This 
explains the need to classify each document pixel, so that 
appropriate processing can be performed. In other 
applications, the pixel classification determined may be 
simply communicated to a subsequent process which ana- 
lyzes the pixel level classification map further. 

Conventionally, all areas of a continuous tone image are 
normally assigned the same classification and processed and 
rendered identically. Typically processing consists of an 
enhancement filter to compensate for loss of sharpness due 
to scanner MTF. Unfortunately, the enhancement filter tends 
to amplify the noise, but having a little noise is much more 
forgivable by the observer than a fuzzy image, so the image 
is enhanced. Rendering may be, for example, error diffusion, 
a periodic screen, or a hybrid of the two. Typically the 
rendering for a continuous tone area is chosen to be as low 
noise as possible (a reduction in graininess is always a 
noticeable improvement). 

Given an appropriate segmentation architecture such as 
the one described above, areas of the image which need to 
be enhanced can be detected with a microclassifier such as 
Sij which delects a gradient in the video. The filler can be 
adjusted in a fuzzy, stepwise continuous fashion based on 
detected level (the detected level being the results of the Sij 
value being compared with a predetermined threshold). This 
adjustment can enable enhancing of most those areas in 
which the video is changing most rapidly and lowpass 
filtering of those areas where the video is flat, thereby 
reducing noise in the video without degrading any image 
features. This results in a "better** image, since the resulting 
image will have less noise in the smooth flat areas, yet will 
be sharp due to the enhancement of image features in the 
areas with a gradient. The noise in the enhanced areas will 
not be perceived due to the masking effect from the video 
transitions. 

The auto-segmentation pipeline, as described above, pro- 
duces all classifications (text, halftone, contone, etc.) and 
uses fuzzy error diffusion/dot screen hybrid rendering to 
minimize the visibility of misclassification artifacts. 
Because error diffusion is not a low noise rendering, this one 
size fits all segmentation/rendering scheme is not the most 
effective environment in which to apply this technique. 
However, according to the concepts of the present invention, 
a special photo mode, selected either manually or based on 
histogram information gathered during a prescan, could 
configure the classification look-up table, the image 
processing, and the rendering scheme optimally. The clas- 
sification look-up table, which would be programmable at 
runtime, would be programmed to produce only smooth and 
rough contone classifications, eliminating any artifacts due 
to spurious text or halftone classification errors. The filter, 
image processing, could be controlled on a pixel-by-pixel 
basis, and a low noise rendering such as a pure dot screen 
could be applied. 

Note that the prescan could be premeditated or elected 
after the initial scan if the system is capable of discarding the 
page just scanned and rescanning it. This would result in a 
productivity improvement since only pages that would ben- 
efit from this special photo-mode would incur a prescan. 
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To realize this feature, the auto-segmentation circuit 
would include a comparator that would compare the Sij 
value with a predetermined threshold. Depending on this 
comparison, the comparator would output a signal indicating 

5 a rough contone or a signal indicating a smooth contone. 
In operation, the present invention determines a continu- 
ous tone classification type of a pixel of image data in a 
window of pixels by classifying the pixel as representing a 
continuous tone; detecting a gradient in the window of 

10 pixels of which the pixel is a member, and classifying the 
pixel as a rough continuous tone when a gradient indicating 
that image data within the window of pixels is changing at 
a rate greater than a predetermined threshold is detected. The 
present invention classifies the pixel as a smooth continuous 

IS tone when a gradient indicating that image data within the 
window of pixels is changing at a rate less than the prede- 
termined threshold is detected. This continuous classifica- 
tion can be either realized manually through a user interface 
or automatically based on histogram information. The gra- 

20 dient can be, for example, measured based on a sum of the 
Laplacians for pixels within the window. 

The present invention also includes a filter to lowpass the 
image data of the pixel tone when a gradient indicating that 
image data within the window of pixels is changing at a rate 

25 less than a predetermined threshold is detected. The filter 
also enhances the image data of the pixel tone when a 
gradient indicating that image data within the window of 
pixels is changing at a rate greater than the predetermined 
threshold. 

30 Although the present invention has been described in 
detail above, various modifications can be implemented 
without departing from the spirit of the present invention. 
For example, the preferred embodiment of the present 
invention has been described with respect to a xerographic 

35 printing system; however, these fuzzy methods and filters 
are readily implemented in a thermal Inkjet system, a display 
system, or other image processing system 

Moreover, the image processing system of the present 
invention can be readily implemented on a general purpose 

40 computer, a personal computer or workstation. The image 
processing system of the present invention can be readily 
implemented on an ASIC, thereby enabling the placement of 
this process in a scanner, electronic subsystem, printer, or 
display device. 

45 The present invention has been described with respect to 
a video range of 0 to 255. However, it is contemplated by the 
present invention that the video range can be any suitable 
range to describe the grey level of the pixel being processed. 
Furthermore, the present invention is readily applicable to 

50 any image processing system, not necessarily a binary 
output device. It is contemplated that the concepts of the 
present invention are readily applicable to a four-level 
output terminal or higher. 

Also, the present invention has been described, with 

55 respect to the fuzzy classification and fuzzy processing 
routines, that the scalar values are determined using the 
weighted sum of the centriod method since the centriods in 
the preferred embodiment are non-overiapping (the classes 
are non-overlapping). However, the present invention is 

60 readily applicable to a system with overlapping classes. 
Such an extension is readily known to those skilled in the art 
of fuzzy logic. 

Lastly, the present invention has been described with 
respect to a monochrome or black/white environment. 

65 However, the concepts of the present invention are readily 
applicable to a color environment. Namely, the processing 
operations of the present invention can be applied to each 
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color space value, some function of a given pixel's color 
space components, or even a function of the color space 
components of a pixel and other pixels in the neighborhood. 

While the invention has been described with reference to 
various embodiments disclosed above, it is not confined to 
the details set forth above, but is intended to cover such 
modifications or changes as may come within the scope of 
the attached claims. 

What is claimed is: 

1. A method for determining a continuous lone classifi- 
cation type of a pixel of image data in a window of pixels, 
comprising the steps of: 

(a) classifying the pixel as representing a continuous tone; 

(b) detecting a gradient in the window of pixels of which 
the pixel is a member; and 

(c) classifying the pixel as a rough continuous tone when 
said step (b) detects a gradient indicating that image 
data within the window of pixels is changing at a rate 
greater than a predetermined threshold. 

2. The method as claimed in claim 1, further comprising 
the step of: 

(d) classifying the pixel as a smooth continuous tone 
when said step (b) detects a gradient indicating that 
image data within the window of pixels is changing at 
a rate less than the predetermined threshold. 

3. The method as claimed in claim 1, wherein said step (a) 
classifies the pixel manually through a user interface. 

4. The method as claimed in claim 1, wherein said step (a) 
classifies the pixel automatically based on histogram infor- 
mation. 

5. The method as claimed in claim 2, wherein said step (a) 
classifies the pixel manually through a user interface. 

6. The method as claimed in claim 2, wherein said step (a) 
classifies the pixel automatically based on histogram infor- 
mation. 

7. The method a claimed in claim 1, wherein said gradient 
is measured based on a sum of the Laplacians for pixels 
within the window. 

8. The method a claimed in claim 2, wherein said gradient 
is measured based on a sum of the Laplacians for pixels 
within the window. 

9. A system for determining a continuous tone classifica- 
tion type of a pixel of image data in a window of pixels, 
comprising: 

an auto-segmentation circuit to enable classifying a pixel 
as a continuous tone; and 

means for detecting a gradient in the window of pixels of 
which the pixel is a member; 

said aulo-segmentation circuit classifying the pixel as a 
rough continuous tone when a gradient indicating that 
image data within the window of pixels is changing at 
a rate greater than a predetermined threshold is 
detected. 

10. The system as claimed in claim 9, wherein said 
auto-segmentation circuit classifies the pixel as a smooth 
continuous tone when a gradient indicating that image data 
within the window of pixels is changing at a rate less than 
a predetermined threshold is detected. 

11. The system as claimed in claim 9, further comprising: 
an user interface which enables an user to classify the 

pixel manually. 

12. The system as claimed in claim 9, wherein said 
auto-segmentation circuit classifies the pixel automatically 
based on histogram information. 
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13. The system as claimed in claim 10, further compris- 
ing: 

an user interface which enables an user to classify the 
pixel manually. 

5 14. The system as claimed in claim 10, wherein said 
auto-segmentation circuit classifies the pixel automatically 
based on histogram information. 

15. The system a claimed in claim 9, wherein said 
gradient is measured based on a sum of the Laplacians for 

10 pixels within the window. 

16. The system a claimed in claim 10, wherein said 
gradient is measured based on a smn of the Laplacians for 
pixels within the window. 

17. A rendering system for processing a pixel of image 
15 data in a window of pixels based on a continuous tone 

classification type, comprising: 

an auto-segmentation circuit to enable classifying a pixel 

as a continuous tone; 
means for detecting a gradient in the window of pixels of 
20 which the pixel is a member; and 

a filter to enhance the image data of the pixel tone when 
a gradient indicating that image data within the window 
of pixels is changing at a rate greater than a predeter- 
mined threshold is detected. 

18. The rendering system as claimed in claim 17, wherein 
said filter low-passes the image data of the pixel tone when 
a gradient indicating that image data within the window of 
pixels is changing at a rate less than a predetermined 
threshold is detected, 

19. The rendering system as claimed in claim 17, further 
comprising: 

an user interface which enables an user to classify the 
pixel manually. 

20. The rendering system as claimed in claim 18, wherein 
said auto-segmentation circuit classifies the pixel automati- 
cally based on histogram information. 

21. The rendering system as claimed in claim 17, further 
comprising: 

an user interface which enables an user to classify the 
pixel manually. 

22. The rendering system as claimed in claim 18, wherein 
said auto-segmentation circuit classifies the pixel automati- 
cally based on histogram information. 

23. The rendering system a claimed in claim 17, wherein 
said gradient is measured based on a sum of the Laplacians 
for pixels within the window. 

24. The rendering system a claimed in claim 18, wherein 
said gradient is measured based on a sum of the Laplacians 
for pixels within the window. 

25. A method for determining a continuous tone classifi- 
cation type of a pixel of image data in a window of pixels, 
comprising the steps of: 

(a) classifying the pixel as representing a continuous tone; 
55 (b) detecting a gradient in the window of pixels of which 
the pixel is a member; and 
(c) classifying the pixel as a smooth continuous tone when 
said step (b) detects a gradient indicating that image 
data within the window of pixels is changing at a rate 
50 less than a predetermined threshold. 

26. A system for determining a continuous tone classifi- 
cation type of a pixel of image data in a window of pixels, 
comprising: 

an auto-segmenlation circuit to enable classifying a pixel 
65 as a continuous tone; and 

means for detecting a gradient in the window of pixels of 
which the pixel is a member. 
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wherein said auto-segmentation circuit classifies the pixel 
as a smooth continuous tone when a gradient indicating 
that image data within the window of pixels is changing 
at a rate less than a predetermined threshold is detected. 
27. A rendering system for processing a pixel of image 
data in a window of pixels based on a continuous tone 
classification type, comprising: 

an auto-segmentation circuit to enable classifying a pixel 
as a continuous tone; 
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means for detecting a gradient in the window of pixels of 

which the pixel is a member, and 
a filter to lowpass the image data of the pixel lone when 

a gradient indicating that image data within the window 

of pixels is changing at a rate less than a predetermined 

threshold is detected. 
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